MultiMon | Notion

Complexity of Integration: Multimodal systems must handle the integration of different data types, each with its own inherent complexities and subtleties. Merging these into a coherent system amplifies the chance of unexpected behaviors.

Spurious Correlations: As noted earlier with CLIP, exploiting spurious correlations can lead to increases in accuracy on specific distributions, but it may result in failures when faced with distribution shifts or unseen examples.

Erroneous Agreement: As mentioned in the abstract, MULTIMON identifies systematic failures by looking for erroneous agreement (inputs that produce the same output but should not). This can happen in multimodal systems when different types of input data lead to similar representations in the shared embedding space, causing the system to mistakenly treat them as equivalent.

Transfer Learning Vulnerabilities: Multimodal systems often leverage pre-trained models and adapt them to new tasks. While this approach is powerful, it might also inherit vulnerabilities and biases from the original training data, leading to unforeseen failures in the new context.

Relationship between tasks (transfer learning), distribution, data types, correlations

Dependency on Distribution: Different tasks may require models to adapt to specific distributions. For example, a speech recognition model trained on a particular accent may perform poorly on another accent. Fine-tuning can help adapt the model to the new distribution.

MultiMon

analyzing a corpus (a large collection of text, images, or other data) to identify instances where the model makes an incorrect prediction or erroneous agreement. "Scraping" refers to the process of automatically extracting information from the data, in this case, the specific inputs and outputs where the model went wrong.
Categorize systematic failure. we find commonalities and similarities in representation, mostly CLIP
1. Specifically, we use GPT-4 to identify systematic failures: generalizable natural-language descriptions of patterns of failures, from the scraped individual failures.
Then flexibly generating novel instances: trigger same type of failure.

14 system failures

encode negation, spatial differences, numerical differences, role ambiguity, quantifiers, and more