ZMIC Journal Club

Dataset Bias

Different from social and stereotypical bias. This mostly concerns the proper coverage of concepts and objects, or in other words, how representative the dataset is for the real world.

Torralba & Efros (2011) presented the dataset classification problem and examined dataset bias in the context of hand-crafted features with SVM classifiers.
Tommasi et al. (2015) studied the dataset classification problem using neural networks.
The concept of classifying different datasets has been further developed in domain adaption methods (Tzeng et al., 2014; Ganin et al., 2016). (adversarially learning)

Battle on Dataset Bias

Paper Info

Table Of Content

Task: Name The Dataset

Answer

User Study 1

User Study 2

Dataset Bias

Torralba & Efros (2011)

Dataset used

Main Observation: 84.7% acc by NN

Observations

Low-level signatures?

Corruptions to suppress signatures

Corruptions' Results

Memorization or Generalization?

Self-supervised learning?

Transfer learning?

Cross-Dataset Generalization

ARE WE THERE YET? -- NO

Inspiration