Language Models

April 1, 2025

DeepSeek: The Quiet Math Tweak That Might Redefine the AI Future

Artificial Intelligence, Research

It didn’t arrive with fireworks or fanfare. DeepSeek slipped into the scene with a simple mathematical change—and suddenly, it was the name on everyone’s lips. But what’s really behind the hype? Let’s unpack how a humble paper sparked global buzz, and what it means for the future of AI, geopolitics, and the race for smarter,…

Written by

Dulan Dias
March 30, 2025

How Do Language Models Learn Facts? Inside the Mysterious Memory of AI

Artificial Intelligence, Research

How do large language models actually learn facts? A new study by Google DeepMind and ETH Zürich uncovers a surprising three-phase process—from slow starts to sudden insights and unexpected memory loss. These findings reveal why your AI assistant sometimes nails the answer—and sometimes confidently makes things up.

Written by

Dulan Dias
February 28, 2025

Understanding Large Language Models (LLMs): The Basics of Their Math, Training, and Inference

Artificial Intelligence

Large Language Models (LLMs) have transformed the world of artificial intelligence, enabling machines to generate human-like text, answer questions, and even write code. But how do they actually work? This article breaks down the key concepts behind LLMs in a way that is easy to understand, with just enough math to show how things come…

Written by

Dulan Dias