Kolbeinsson, Arinbjörn

11 publications

ICLR 2026 Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Mike A Merrill, Alexander Glenn Shaw, Nicholas Carlini, Boxuan Li, Harsh Raj, Ivan Bercovich, Lin Shi, Jeong Yeon Shin, Thomas Walshe, E. Kelly Buchanan, Junhong Shen, Guanghao Ye, Haowei Lin, Jason Poulos, Maoyu Wang, Marianna Nezhurina, Di Lu, Orfeas Menis Mastromichalakis, Zhiwei Xu, Zizhao Chen, Yue Liu, Robert Zhang, Leon Liangyu Chen, Anurag Kashyap, Jan-Lucas Uslu, Jeffrey Li, Jianbo Wu, Minghao Yan, Song Bian, Vedang Sharma, Ke Sun, Steven Dillmann, Akshay Anand, Andrew Lanpouthakoun, Bardia Koopah, Changran Hu, Etash Kumar Guha, Gabriel H. S. Dreiman, Jiacheng Zhu, Karl Krauth, Li Zhong, Niklas Muennighoff, Robert Kwesi Amanfu, Shangyin Tan, Shreyas Pimpalgaonkar, Tushar Aggarwal, Xiangning Lin, Xin Lan, Xuandong Zhao, Yiqing Liang, Yuanli Wang, Zilong Wang, Changzhi Zhou, David Heineman, Hange Liu, Harsh Trivedi, John Yang, Junhong Lin, Manish Shetty, Michael Yang, Nabil Omi, Negin Raoof, Shanda Li, Terry Yue Zhuo, Wuwei Lin, Yiwei Dai, Yuxin Wang, Wenhao Chai, Shang Zhou, Dariush Wahdany, Ziyu She, Jiaming Hu, Zhikang Dong, Yuxuan Zhu, Sasha Cui, Ahson Saiyed, Arinbjörn Kolbeinsson, Christopher Michael Rytting, Ryan Marten, Yixin Wang, Jenia Jitsev, Alex Dimakis, Andy Konwinski, Ludwig Schmidt
ICLR 2025 Composable Interventions for Language Models Arinbjörn Kolbeinsson, Kyle O'Brien, Tianjin Huang, Shanghua Gao, Shiwei Liu, Jonathan Richard Schwarz, Anurag Jayant Vaidya, Faisal Mahmood, Marinka Zitnik, Tianlong Chen, Thomas Hartvigsen
NeurIPSW 2024 Adversarial Negotiation Dynamics in Generative Language Models Arinbjörn Kolbeinsson, Benedikt Kolbeinsson
ICLRW 2024 Composing Knowledge and Compression Interventions for Language Models Arinbjörn Kolbeinsson, Tianjin Huang, Shanghua Gao, Shiwei Liu, Jonathan Richard Schwarz, Anurag Jayant Vaidya, Faisal Mahmood, Marinka Zitnik, Tianlong Chen, Thomas Hartvigsen
NeurIPSW 2024 Position: Classifying GenAI Under the European Union’s Medical Device Regulation Benedikt Kolbeinsson, Arinbjörn Kolbeinsson
NeurIPSW 2024 Position: Transparent Reporting for Healthcare GenAI Arinbjörn Kolbeinsson, Benedikt Kolbeinsson
NeurIPSW 2023 Generative Models for Wearables Data Arinbjörn Kolbeinsson, Luca Foschini
CHIL 2023 Homekit2020: A Benchmark for Time Series Classification on a Large Mobile Sensing Dataset with Laboratory Tested Ground Truth of Influenza Infections Mike A Merrill, Esteban Safranchik, Arinbjörn Kolbeinsson, Piyusha Gade, Ernesto Ramirez, Ludwig Schmidt, Luca Foschini, Tim Althoff
ICLRW 2023 Stasis: Reinforcement Learning Simulators for Human-Centric Real-World Environments Georgios Efstathiadis, Patrick Emedom-Nnamdi, Arinbjörn Kolbeinsson, Jukka-Pekka Onnela, Junwei Lu
AAAI 2021 PenDer: Incorporating Shape Constraints via Penalized Derivatives Akhil Gupta, Lavanya Marla, Ruoyu Sun, Naman Shukla, Arinbjörn Kolbeinsson
JMLR 2020 Tensor Regression Networks Jean Kossaifi, Zachary C. Lipton, Arinbjorn Kolbeinsson, Aran Khanna, Tommaso Furlanello, Anima Anandkumar