Foundation Model Frontiers

Sommersemester 2025
Hinrich Schütze, Shengqiang Zhang
Fr 10:15-11:45

Room

Room 057, Oettingenstr 67

Topic

Foundation models have been for the last few years and continue to be a highly dynamic research area -- in terms of scientific progress, technical innovation and real-world impact. In this seminar, we will review and discuss the latest developments in foundation models, including new breakthroughs as they happen.

Credit for MSc Computerlinguistik

To get credit for this class, you must give a presentation AND write a thesis (Hausarbeit).
Topics of presentation and thesis do not have to be on exactly the same topic.
Length of presentation: 30 minutes + 15 minutes Q&A/discussion
Evaluation of presentation and thesis: pass/fail
Pruefungsordnung for MSc Computational Linguistics

Schedule

day	topic	resources	details

Apr 25	introduction		organization, lectures, student topics

May 2	no class

May 9	synthetic data		talk by Latif Köksal, DeepMind

May 16	discussion of topics
	scheduling

May 23	memory etc.	NoLiMa	talk by Ali Modaresi

May 30	refusal		paper

June 6	ARC challenge	pdf
	key-value memory	pdf

Jun 13	AI co-scientist	pdf
Jun 13	Gemma 3	pdf

Jun 20	latent reasoning	pdf
	in-context reasoning explained	pdf

Jun 27	synthetic data	pdf
	humanity's last exam	pdf

Jul 4

Jul 11

Jul 18	multilingual reasoning	Mingyang Wang	paper

Jul 25	formal methods for LLMs	William Merrill, NYU

Topics for Referat and Hausarbeit

Topics and papers given for each topic are (somewhat random) examples. Feel free to propose your own topics and papers for your Referat/Hausarbeit.

arch = architectures, including agentic systems and human-agent collaboration

reas = reasoning

interp = interpretability

tech = technical report

eval = evaluation

synth = synthetic data

misc = miscellaneous

paper topic

all topics covered in the lectures (see above)

reas Geiping et al. (2025) test-time compute: recurrent depth approach

reas DeepSeek-AI (2025) DeepSeek-R1: reasoning through RL

tech Gemma Team (2025) Gemma 3 technical report

interp Olsson et al. (2022) induction heads

synth Maini et al. (2024) rephrasing the web

interp Sharkey et al. (2025) open problems in mechanistic interpretability

interp Park et al. (2024) linear representation hypothesis

interp Han et al. (2024) word embeddings are steers

tech Llama Team (2024) Llama 4

tech Llama Team (2024) Llama 3

tech Xu et al. (2025) Qwen 2.5-Omni

tech Bai et al. (2025) Qwen 2.5-VL

tech Qwen et al. (2024) Qwen 2.5

tech Abdin et al. (2024) Phi-4

interp Makelov et al. (2024) sparse autoencoders (2)

interp McDougall et al. (2023) copy suppression

interp Saphra et al. (2024) notion of mechinterp

reas Dutta et al. (2024) mechinterp: COT

interp Geva et al. (2023) factual associations/enrichment

interp nostalgebrist (2020) logit lens

interp Chughtai et al. (2024) summing up the facts

reas Shao et al. (2024) DeepSeekMath

reas Zhao et al. (2024) Marco-o1

arch Wu et al. (2024) REFT

reas Hübotter et al. (2025) SIFT

misc Hughes et al. (2024) open-endedness

reas Turpin et al. (2023) unfaithful COT

arch Gottweis et al. (2025) AI co-scientist

reas Venhoff et al. (2025) steered reasoning

interp Yu et al. (2024) superweights

interp Bricken et al. (2023) sparse autoencoders (1)

arch Packer et al. (2023) MemGPT

misc Milliere et al. (2024) philosophy of LLMs

arch StanfordNLP (2024) DSPy

interp Durrani et al. (2020) analyzing neurons

interp Voita et al. (2023) dead neurons

arch De Peuter et al. (2023) human-agent cooperation

tech Ustun et al. (2024) Aya

tech Google NotebookLM

tech Groeneveld et al. (2024) Olmo

interp Dai et al. (2022) knowledge neurons

interp Elhelo et al. (2024) head functionality

interp Ferrando et al. (2024) information flow routes

misc Nancy Yu (2024) llm censorship

interp Lad et al. (2024) stages of inference

interp Geva et al. (2021) llms as key value memories

interp Wendler et al. (2024) multilingual representations

tech Anthropic (2025) PR Claude 3.7 Sonnet

tech DeepSeek-AI (2025) DeepSeek V3

interp Lindsey et al. (2025) biology of LLMs

eval Phan et al. (2025) humanity's last exam

tech Team Cohere (2025) Command A

misc Asai et al. (2023) Self-RAG

misc Xu et al. (2024) ActiveRAG

misc Jeong et al. (2024) Adaptive-RAG

	paper	topic
		all topics covered in the lectures (see above)
reas	Geiping et al. (2025)	test-time compute: recurrent depth approach
reas	DeepSeek-AI (2025)	DeepSeek-R1: reasoning through RL
tech	Gemma Team (2025)	Gemma 3 technical report
interp	Olsson et al. (2022)	induction heads
synth	Maini et al. (2024)	rephrasing the web
interp	Sharkey et al. (2025)	open problems in mechanistic interpretability
interp	Park et al. (2024)	linear representation hypothesis
interp	Han et al. (2024)	word embeddings are steers
tech	Llama Team (2024)	Llama 4
tech	Llama Team (2024)	Llama 3
tech	Xu et al. (2025)	Qwen 2.5-Omni
tech	Bai et al. (2025)	Qwen 2.5-VL
tech	Qwen et al. (2024)	Qwen 2.5
tech	Abdin et al. (2024)	Phi-4
interp	Makelov et al. (2024)	sparse autoencoders (2)
interp	McDougall et al. (2023)	copy suppression
interp	Saphra et al. (2024)	notion of mechinterp
reas	Dutta et al. (2024)	mechinterp: COT
interp	Geva et al. (2023)	factual associations/enrichment
interp	nostalgebrist (2020)	logit lens
interp	Chughtai et al. (2024)	summing up the facts
reas	Shao et al. (2024)	DeepSeekMath
reas	Zhao et al. (2024)	Marco-o1
arch	Wu et al. (2024)	REFT
reas	Hübotter et al. (2025)	SIFT
misc	Hughes et al. (2024)	open-endedness
reas	Turpin et al. (2023)	unfaithful COT
arch	Gottweis et al. (2025)	AI co-scientist
reas	Venhoff et al. (2025)	steered reasoning
interp	Yu et al. (2024)	superweights
interp	Bricken et al. (2023)	sparse autoencoders (1)
arch	Packer et al. (2023)	MemGPT
misc	Milliere et al. (2024)	philosophy of LLMs
arch	StanfordNLP (2024)	DSPy
interp	Durrani et al. (2020)	analyzing neurons
interp	Voita et al. (2023)	dead neurons
arch	De Peuter et al. (2023)	human-agent cooperation
tech	Ustun et al. (2024)	Aya
tech	Google	NotebookLM
tech	Groeneveld et al. (2024)	Olmo
interp	Dai et al. (2022)	knowledge neurons
interp	Elhelo et al. (2024)	head functionality
interp	Ferrando et al. (2024)	information flow routes
misc	Nancy Yu (2024)	llm censorship
interp	Lad et al. (2024)	stages of inference
interp	Geva et al. (2021)	llms as key value memories
interp	Wendler et al. (2024)	multilingual representations
tech	Anthropic (2025) PR	Claude 3.7 Sonnet
tech	DeepSeek-AI (2025)	DeepSeek V3
interp	Lindsey et al. (2025)	biology of LLMs
eval	Phan et al. (2025)	humanity's last exam
tech	Team Cohere (2025)	Command A
misc	Asai et al. (2023)	Self-RAG
misc	Xu et al. (2024)	ActiveRAG
misc	Jeong et al. (2024)	Adaptive-RAG