Folgen
Sainbayar Sukhbaatar
Sainbayar Sukhbaatar
FAIR team, Meta AI
Bestätigte E-Mail-Adresse bei fb.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
End-To-End Memory Networks
S Sukhbaatar, A Szlam, J Weston, R Fergus
33682015
Learning multiagent communication with backpropagation
S Sukhbaatar, A Szlam, R Fergus
Advances in Neural Information Processing Systems, 2244-2252, 2016
14592016
Training Convolutional Networks with Noisy Labels
S Sukhbaatar, J Bruna, M Paluri, L Bourdev, R Fergus
Accepted as a workshop contribution at ICLR 2015, 2014
1017*2014
Intrinsic motivation and automatic curricula via asymmetric self-play
S Sukhbaatar, Z Lin, I Kostrikov, G Synnaeve, A Szlam, R Fergus
arXiv preprint arXiv:1703.05407, 2017
4552017
Simple baseline for visual question answering
B Zhou, Y Tian, S Sukhbaatar, A Szlam, R Fergus
arXiv preprint arXiv:1512.02167, 2015
4322015
Learning when to communicate at scale in multiagent cooperative and competitive tasks
A Singh, T Jain, S Sukhbaatar
arXiv preprint arXiv:1812.09755, 2018
3662018
Adaptive attention span in transformers
S Sukhbaatar, E Grave, P Bojanowski, A Joulin
arXiv preprint arXiv:1905.07799, 2019
3332019
Self-rewarding language models
W Yuan, RY Pang, K Cho, S Sukhbaatar, J Xu, J Weston
arXiv preprint arXiv:2401.10020, 2024
3112024
Hash layers for large sparse models
S Roller, S Sukhbaatar, J Weston
Advances in Neural Information Processing Systems 34, 17555-17566, 2021
1962021
Augmenting self-attention with persistent memory
S Sukhbaatar, E Grave, G Lample, H Jegou, A Joulin
arXiv preprint arXiv:1907.01470, 2019
1302019
Composable planning with attributes
A Zhang, S Sukhbaatar, A Lerer, A Szlam, R Fergus
International Conference on Machine Learning, 5842-5851, 2018
852018
Mazebase: A sandbox for learning from games
S Sukhbaatar, A Szlam, G Synnaeve, S Chintala, R Fergus
arXiv preprint arXiv:1511.07401, 2015
832015
Iterative reasoning preference optimization
RY Pang, W Yuan, K Cho, H He, S Sukhbaatar, J Weston
arXiv preprint arXiv:2404.19733, 2024
822024
Addressing Some Limitations of Transformers with Feedback Memory
A Fan, T Lavril, E Grave, A Joulin, S Sukhbaatar
arXiv preprint arXiv:2002.09402, 2020
80*2020
Memory-augmented reinforcement learning for image-goal navigation
L Mezghan, S Sukhbaatar, T Lavril, O Maksymets, D Batra, P Bojanowski, ...
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
732022
Learning goal embeddings via self-play for hierarchical reinforcement learning
S Sukhbaatar, E Denton, A Szlam, R Fergus
arXiv preprint arXiv:1811.09083, 2018
672018
Some things are more cringe than others: Preference optimization with the pairwise cringe loss
J Xu, A Lee, S Sukhbaatar, J Weston
arXiv preprint arXiv:2312.16682, 2023
612023
System 2 Attention (is something you might need too)
J Weston, S Sukhbaatar
arXiv preprint arXiv:2311.11829, 2023
562023
End-to-end memory networks
JE Weston, AD Szlam, RD Fergus, S Sukhbaatar
US Patent 10,664,744, 2020
462020
Teaching large language models to reason with reinforcement learning
A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ...
arXiv preprint arXiv:2403.04642, 2024
452024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20