{"id":4411,"date":"2025-10-23T13:24:48","date_gmt":"2025-10-23T17:24:48","guid":{"rendered":"https:\/\/oge.mit.edu\/msrp\/?post_type=profiles&#038;p=4411"},"modified":"2025-12-15T11:13:33","modified_gmt":"2025-12-15T16:13:33","slug":"amiri-hayes","status":"publish","type":"profiles","link":"https:\/\/oge.mit.edu\/msrp\/profiles\/amiri-hayes\/","title":{"rendered":"Amiri Hayes"},"content":{"rendered":"<div class=\"wp-block-image\">\n<figure class=\"alignleft size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"2560\" height=\"2560\" src=\"https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-scaled.jpg\" alt=\"\" class=\"wp-image-4412\" style=\"width:200px;height:auto\" srcset=\"https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-scaled.jpg 2560w, https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-300x300.jpg 300w, https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-1024x1024.jpg 1024w, https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-150x150.jpg 150w, https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-768x768.jpg 768w, https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-1536x1536.jpg 1536w, https:\/\/oge.mit.edu\/msrp\/wp-content\/uploads\/sites\/2\/2025\/10\/HayesAmiri-edited-2048x2048.jpg 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/figure>\n<\/div>\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<p><strong>MIT Department:<\/strong> Electrical Engineering and Computer Science<br><strong>Faculty Mentor<\/strong>: Prof. Jacob Andreas<br><strong>Research Supervisor:<\/strong> Belinda Li<br><strong>Undergraduate Institution:<\/strong> New Jersey Institute of Technology<br><strong>Website<\/strong>:<\/p>\n<\/div><\/div>\n\n\n\n<div style=\"height:0px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Biography<\/strong><\/h4>\n\n\n\n<p>Amiri Hayes is an incoming senior in the Honors College at New Jersey Institute of Technology pursuing a Bachelor&#8217;s Degree in Applied Mathematics alongside a dual Master\u2019s in Artificial Intelligence. Before matriculating to NJIT, he was a homeschooled student who earned three associate degrees in Physics, Mathematics, and Computer Science at Rowan College of South Jersey &#8211; Gloucester alongside his high school diploma. Since then, he has held a software engineering co-op position at UPS as well as a summer position as aMathematics Student Researcher at the Institute for Pure and Applied Mathematics at UCLA. His other interests involve investing and computational social science, which have led to his involvement in co-founding an Investment Club and conducting independent research in automated transportation infrastructure evaluations. Amiri aspires to attend graduate school to improve his ability to use mathematics and statistics to model complex systems.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Abstract<\/strong><\/h4>\n\n\n\n<p class=\"has-text-align-center\"><strong>Filtering Attention Heads through Automable Interpretability Experiments<\/strong><\/p>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-group is-vertical is-content-justification-center is-nowrap is-layout-flex wp-container-core-group-is-layout-73832be3 wp-block-group-is-layout-flex\">\n<p class=\"has-text-align-center\"><strong>Amiri Hayes<sup>1<\/sup>, Jacob Andreas<sup>2<\/sup>, and Belinda Li<sup>2<\/sup><\/strong><\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-content-justification-center is-layout-flex wp-container-core-group-is-layout-4b2eccd6 wp-block-group-is-layout-flex\">\n<p class=\"has-text-align-center\"><sup>1<\/sup>Department of Mathematical Sciences, New Jersey Institute of Technology<\/p>\n\n\n\n<p class=\"has-text-align-center\"><sup>2<\/sup>Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology<\/p>\n<\/div>\n<\/div>\n<\/div><\/div>\n<\/div><\/div>\n\n\n\n<p class=\"has-text-align-center\"><\/p>\n\n\n\n<p>Transformer-based large language models (LLMs) like BERT and GPT have transformed natural language processing, yet their internal mechanisms remain opaque. To improve interpretability, we focus on understanding the function of attention heads, which are learned components that direct focus across input sequences. Prior work shows that some heads consistently track syntactic or semantic relationships, suggesting interpretable structure. We propose an automated, generalizable method for describing attention heads in human-interpretable terms using program synthesis: we associate each head with a symbolic program that specifies how the head might operate. Such programs exist for a variety of phenomena and serve as hypotheses which can then be tested against actual attention activations by computing distance metrics. Additionally, we explore whether LLMs can aid in the process of constructing these programs by predicting and programmatically testing their own hypothesis about head functions. By analyzing attention behavior across layers, models, and datasets, we assess which functions are stable and generalizable. Our findings suggest that many attention heads exhibit consistent, interpretable behavior, and that program-driven analysis can effectively reveal roles of specific attention mechanisms. This project contributes a framework for reverse-engineering attention functions, helping to bridge the gap between black-box model architecture and human linguistic understanding.<\/p>\n\n\n\n<p><\/p>\n","protected":false},"featured_media":4412,"template":"","profile_category":[23],"class_list":["post-4411","profiles","type-profiles","status-publish","has-post-thumbnail","hentry","profile_category-2025-interns"],"acf":[],"_links":{"self":[{"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/profiles\/4411","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/profiles"}],"about":[{"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/types\/profiles"}],"version-history":[{"count":2,"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/profiles\/4411\/revisions"}],"predecessor-version":[{"id":4807,"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/profiles\/4411\/revisions\/4807"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/media\/4412"}],"wp:attachment":[{"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/media?parent=4411"}],"wp:term":[{"taxonomy":"profile_category","embeddable":true,"href":"https:\/\/oge.mit.edu\/msrp\/wp-json\/wp\/v2\/profile_category?post=4411"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}