Howdy, wizards.
Hereβs whatβs brewing in AI this Friday.
DARIOβS PICKS
Google just released a new model, Gemini Exp-1121, butβ¦
β¦this needs context:
1) It started by Google releasing an upgraded model last week, Gemini Exp-1114, climbing to the top of the model ranking in the lmarena leaderboard.
2) Apparently, OpenAI wasnβt having it, because yesterday they put out an improved GPT-4o version (particularly better at creative writing) which quickly dethroned Googleβs new model.
3) However β it looks like Google knew what was coming β as they launched yet another new model today, Gemini Exp-1121. It has already climbed to the #1 rank in lmarena.
As a side note, while lmarenaβs leaderboard is by far the most cited one, the alternative (βcontamination-freeβ) benchmark at LiveBench tells a different story. Here, Gemini Exp-1121 ranks below 5 other models including o1-preview/mini and Claude 3.5 Sonnet.
The release timings and lack of detail about all these new models are noteworthy. Some say Google played OpenAI by throwing the first, slightly better model out as bait, and waiting for them to launch something marginally better, only to come back even stronger. But β this is an ongoing story and we donβt know whatβs going to be released in the next few weeks.
β Why it mattersβ β In practice, I think weβll see little difference to our AI applications with each of these updates. I think thereβs a bigger, more comprehensive, and better explained model release coming from both OpenAI and Google in the near future β which will be far more important. For now, itβs mostly entertaining to watch them play leapfrog with each other.
TOGETHER WITH VISIBLE BY VERIZON
Let me guess: you've been putting up with spotty service and those sneaky rate hikes from your current wireless provider for years, right? Well, guess what? You deserve way better than that.
With Visible, you can get unlimited data, talk, text, and hotspot starting at just $275/year. Enjoy dependable 5G and 4G network coverage with no contracts and no hidden feesβ so you can enjoy reliable wireless without the long-term commitment.
Switch to Visible today and experience the freedom of great service at an unbeatable price. Whatβs brewing in AI readers save up to $145 on an annual plan, no code required. But itβs for a limited time so donβt wait to make the switch.
DARIOβS PICKS
Chinese AI research lab DeepSeek launched a powerful reasoning model this week called R1-Lite-Preview. It matches OpenAIβs o1 on certain benchmarks like AIME and MATH β and shows you even more information about its chain of thought process as its thinking. The R1-Lite-Preview model is accessible through DeepSeek Chat, and capped at 50 daily messages.
β Why it mattersβ β Just over two months after OpenAI released o1-preview, a lesser-known lab has developed a competitive alternative; AI is really moving at lightning speed. This might put some extra pressure on OpenAI to release the full o1 version soon.
FROM OUR PARTNERS
Give your inbox a brain. Ceejay is an assistant that finds any email instantly and manages your calendar through natural conversation. Stop searching, just ask: 'What did Sarah say about the budget?'
Download for iPhone today, and reclaim 2-3 hours of your day.
THATβS ALL FOLKS!
Was this email forwarded to you? Sign up here.
Want to get in front of 13,000 AI enthusiasts? Work with me.
This newsletter is written & curated by Dario Chincha.
What's your verdict on today's email?
Affiliate disclosure: To cover the cost of my email software and the time I spend writing this newsletter, I sometimes link to products and other newsletters. Please assume these are affiliate links. If you choose to subscribe to a newsletter or buy a product through any of my links then THANK YOU β it will make it possible for me to continue to do this.



