Akari Asai

I work on open, reliable LMs: augmented LMs & agents (RAG, tool use, deep research), safety (hallucinations, copyright), and AI for science, code & multilinguality & open to bold new ideas! FAQ in

140K

Akari Asai

@AkariAsai

Jul 16, 2025

Some updates

I finished my Ph.D at

@uwcse

in June 2025! After a year at AI2 as a Research Scientist, I am joining CMU

@LTIatCMU

@mldcmu

(courtesy) as an Assistant Professor in Fall 2026. The journey, acknowledgments & recruiting in

1/ Introducing ᴏᴘᴇɴꜱᴄʜᴏʟᴀʀ: a retrieval-augmented LM to help scientists synthesize knowledge

@uwnlp

@allen_ai

With open models & 45M-paper datastores, it outperforms proprietary systems & match human experts. Try out our demo! We also introduce ꜱᴄʜᴏʟᴀʀQᴀʙᴇɴᴄʜ,

The media could not be played.

I’m on the job market this year!

I’m completing my

@uwcse

Ph.D. (2025), where I identify and tackle key LLM limitations like hallucinations by developing new models—Retrieval-Augmented LMs—to build more reliable real-world AI systems. Learn more in the thread!

日本(の学部)からアメリカのコンピューターサイエンス博士課程に出願を検討している方向けにブログを書きました。日本の学部からアメリカのコンピューターサイエンス博士課程に出願する - あさりさんの作業ログ akaria.hatenablog.com/entry/2019/08/

Akari Asai

@AkariAsai

Oct 20, 2023

Introducing Self-RAG, a new easy-to-train, customizable, and powerful framework for making an LM learn to retrieve, generate, and critique its own outputs and retrieved passages, by using model-predicted reflection tokens.

: arxiv.org/abs/2310.11511

: selfrag.github.io

The media could not be played.

New paper

arxiv.org/abs/2211.09260 Can we train a single search system that satisfies our diverse information needs? We present 𝕋𝔸ℝ𝕋

the first multi-task instruction-following retriever trained on 𝔹𝔼ℝℝ𝕀

, a collections of 40 retrieval tasks with instructions! 1/N

Akari Asai

@AkariAsai

May 24, 2022

Introducing 𝗔𝗧𝗧𝗘𝗠𝗣𝗧, a new modular, multi-task, and parameter-efficient approach to combine knowledge from multiple tasks to solve a new task using a small trainable of parameters

while keeping the original LM *frozen*

[1/9] Paper

: homes.cs.washington.edu/~akari/papers/

Hanna Hajishirzi and UW NLP

Akari Asai

@AkariAsai

Dec 21, 2022

Can we solely rely on LLMs’ memories (eg replace search w ChatGPT)? Probably not. Is retrieval a silver bullet? Probably not either. Our analysis shows how retrieval is complementary to LLMs’ parametric knowledge [1/N]

tinyurl.com/2sdeuupn

github.com/AlexTMallen/ad

Don't miss our #ACL2023 tutorial on Retrieval-based LMs and Applications this Sunday! acl2023-retrieval-lm.github.io with

@sewon__min

@ZexuanZhong

@danqi_chen

We'll cover everything from architecture design and training to exploring applications and tackling open challenges! [1/2]

（便乗してみる...）東大に文科で入学して一度経済学部に進学しましたが工学部電子情報工学科を卒業してアメリカのCS博士課程でNLP/機械学習の研究をしています。特にプログラミングはもっと早く始めたかった（20歳まで未経験でした）とたまに思いますが楽しいです

Quote

五十嵐祐花

@00_

May 26, 2020

すごく今更だけど、昔から平均以下の数学の才能しかなくて（東大数学も二完以下だったし）高二まで文系行くかなと思ってた人間がMITの博士課程に進学するとか無謀過ぎて笑えてくる。どうしてこうなったんだっけ....。

Akari Asai

@AkariAsai

Jan 19, 2024

We all complain about LLM "hallucinations", but what are they? We study Automatic Fine-grained Hallucination Detection, with a novel taxonomy, a benchmark, and a 7B LM, surpassing ChatGPT in hallucination detection and editing fine-grained-hallucination.github.io arxiv.org/abs/2401.06855

New work with Kazuma Hashimoto,

, and

and

! Our trainable graph-based retriever-reader framework for open-domain QA advances state of the art on HotpotQA, SQuAD Open, Natural Questions Open.

1/7

Caiming Xiong and Hanna Hajishirzi

Akari Asai

@AkariAsai

Mar 7, 2024

𝗛𝗼𝘄 𝗰𝗮𝗻 𝘄𝗲 𝗯𝘂𝗶𝗹𝗱 𝗺𝗼𝗿𝗲 𝗿𝗲𝗹𝗶𝗮𝗯𝗹𝗲 𝗟𝗠-𝗯𝗮𝘀𝗲𝗱 𝘀𝘆𝘀𝘁𝗲𝗺𝘀? Our new position paper advocates for retrieval-augmented LMs (RALMs) as the next gen. of LMs, exploring the promises, limitations, and a roadmap for wider adoption. arxiv.org/abs/2403.03187

A deadline life pro tip: now you can use Grammarly with Overleaf

Grammarly | Overleaf docs

From docs.overleaf.com

New paper

Can LLMs perform well across languages? Our new benchmark BUFFET enables a fair eval. for few-shot NLP across languages in scale. Surprisingly, LLMs+Incontext learning (incl. ChatGPT) are often outperformed by much smaller fine-tuned LMs

Retrieval-augmented LMs have made great progress& been adapted to real-world applications. Yet we still face major challenges. We

will be giving an ACL 2023 tutorial on retrieval-LMs! Join us to learn more about this exciting area.

Recently I gave a lecture about retrieval-augmented LMs like RAG, covering their advantages, an overview of diverse methods, and current limitations & opportunities, based on this position paper. akariasai.github.io/assets/pdf/aka video: shorturl.at/ahmq8 Feedback is welcomed :)

Quote

Akari Asai

@AkariAsai

Mar 7, 2024

New #acl2020nlp paper "Logic-Guided Data Augmentation and Regularization for Consistent Question Answerings"! We show SOTA QA models produce inconsistent predictions and introduce logic-guided data augmentation & consistency-based regularization. arxiv.org/abs/2004.10157 1/

Grad school season reminder: many CS departments run student-led pre-application mentorship programs for prospective PhD applicants (due Oct. You can get feedback from current PhD students! Eg - UW’s CSE PAMS: cs.washington.edu/academics/grad - MIT EECS GAAP: eecs-gaap.mit.edu

Pre-Application Mentorship Service (PAMS)

From cs.washington.edu

Our paper got the ACL 2023 best video award (at EMNLP) The video by

@alextmallen

is available at youtu.be/hJbxW0xct2E?si This 5 mins video summarizes the interesting findings on (1) when LLMs hallucinate (and scaling may not help) how retrieval-augmented LMs alleviate it.

Quote

Alex Mallen

@alextmallen

May 3, 2023

Our work "When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories" will appear in #ACL2023!! This is my first NLP conference paper and I'm very happy I got to pursue this project with these amazing people at UW! x.com/AkariAsai/stat…

Can we build a *single* open-domain QA model that works in *many* languages? We’re excited to present 𝗖𝗢𝗥𝗔 using a single retriever and generator, showing SOTA results in 26 diverse languages on XOR QA & MKQA, including the unseen languages. [1/5] arxiv.org/abs/2107.11976

This is a comprehensive list of the must-read papers on the recent progress of self supervised NLP models (or impressive capabilities of LLMs) and great summary slides! I also love the role-playing paper-reading seminar fromat! (colinraffel.com/blog/role-play)

Quote

Daniel Khashabi

@DanielKhashabi

Sep 29, 2022

For my first course at @jhuclsp, I am leading a class on recent developments in "self-supervised models." Here is the list of the papers and slides we cover: self-supervised.cs.jhu.edu Would love to hear Twitter's suggestions for additional exciting developments to discuss!

Akari Asai

@AkariAsai

May 15, 2025

Honored to be named to the Forbes 30 Under 30 Asia 2025 in Science! Grateful for the recognition of my Ph.D. work on Retrieval-Augmented LMs, and excited to keep pushing the boundaries of reliable and efficient language models.

forbes.com/30-under-30/20 More updates soon…

Forbes 30 Under 30 2025: Healthcare & Science

A powerful retriever+pre-trained generator (eg. DPR+T5) often relies on spurious cues / generates hallucinations. Our 𝕖𝕧𝕚𝕕𝕖𝕟𝕥𝕚𝕒𝕝𝕚𝕥𝕪-guided generator learns to focus and generate on the right passages and shows large improvements in QA/fact verification/dialogue

Hanna Hajishirzi and Matt Gardner

Akari Asai

@AkariAsai

Feb 25, 2020

Our #ICLR2020 camera-ready version, code, and blog are now available! paper: arxiv.org/abs/1911.10470 code: github.com/AkariAsai/lear blog: blog.einstein.ai/learning-to-re You can train, evaluate, and run an interactive demo on your machine. We also release the models for reproducibility.

Quote

Akari Asai

@AkariAsai

Dec 6, 2019

New work with Kazuma Hashimoto, @HannaHajishirzi, @RichardSocher, and @CaimingXiong at @SFResearch and @uwnlp! Our trainable graph-based retriever-reader framework for open-domain QA advances state of the art on HotpotQA, SQuAD Open, Natural Questions Open.

1/7