new

Get trending papers in your email inbox!

Subscribe

Daily Papers

byAK and the research community

Dec 12

The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks

NLP models have progressed drastically in recent years, according to numerous datasets proposed to evaluate performance. Questions remain, however, about how particular dataset design choices may impact the conclusions we draw about model capabilities. In this work, we investigate this question in the domain of compositional generalization. We examine the performance of six modeling approaches across 4 datasets, split according to 8 compositional splitting strategies, ranking models by 18 compositional generalization splits in total. Our results show that: i) the datasets, although all designed to evaluate compositional generalization, rank modeling approaches differently; ii) datasets generated by humans align better with each other than they with synthetic datasets, or than synthetic datasets among themselves; iii) generally, whether datasets are sampled from the same source is more predictive of the resulting model ranking than whether they maintain the same interpretation of compositionality; and iv) which lexical items are used in the data can strongly impact conclusions. Overall, our results demonstrate that much work remains to be done when it comes to assessing whether popular evaluation datasets measure what they intend to measure, and suggest that elucidating more rigorous standards for establishing the validity of evaluation sets could benefit the field.

  • 3 authors
·
Oct 26, 2023

On the Electron Pairing Mechanism of Copper-Oxide High Temperature Superconductivity

The elementary CuO2 plane sustaining cuprate high-temperature superconductivity occurs typically at the base of a periodic array of edge-sharing CuO5 pyramids. Virtual transitions of electrons between adjacent planar Cu and O atoms, occurring at a rate t/{hbar} and across the charge-transfer energy gap E, generate 'superexchange' spin-spin interactions of energy Japprox4t^4/E^3 in an antiferromagnetic correlated-insulator state. However, Hole doping the CuO2 plane converts this into a very high temperature superconducting state whose electron-pairing is exceptional. A leading proposal for the mechanism of this intense electron-pairing is that, while hole doping destroys magnetic order it preserves pair-forming superexchange interactions governed by the charge-transfer energy scale E. To explore this hypothesis directly at atomic-scale, we combine single-electron and electron-pair (Josephson) scanning tunneling microscopy to visualize the interplay of E and the electron-pair density nP in {Bi_2Sr_2CaCu_2O_{8+x}}. The responses of both E and nP to alterations in the distance {\delta} between planar Cu and apical O atoms are then determined. These data reveal the empirical crux of strongly correlated superconductivity in CuO2, the response of the electron-pair condensate to varying the charge transfer energy. Concurrence of predictions from strong-correlation theory for hole-doped charge-transfer insulators with these observations, indicates that charge-transfer superexchange is the electron-pairing mechanism of superconductive {Bi_2Sr_2CaCu_2O_{8+x}}.

  • 9 authors
·
Aug 8, 2021