florio.dev

EP09 - The Thinking Process of LLMs. With Sara Marjanovic

On other platforms: Web, Apple Podcast, YouTube.


I had the chance to chat with Sara Marjanovic, PhD student at University of Copenhagen, about the thinking process of LLMs.

Deepseek R1 has been the first open model with a visible thinking trace, and this opened the doors to new ways to evaluate and research LLMs. It made possible to benchmark thinking vs non-thinking models, compare different reasoning processes, look at traces to see what the reasoning process looks like, and find potential flaws or research direction to improve the effectiveness, as well as see how it influences the behaviour of the model.

What's interesting about looking at the thinking process? Few things stood out to me from the conversation with Sara:

Interesting? Then go ahead and listen to the episode for the full chat. And if you really want to dig deeper, have a look at the paper "DeepSeek-R1 Thoughtology: Let’s think about LLM reasoning".

That's it, see you at the next episode!

#AI #LLM #research #targzpodcast #thinking