Welcome to the 387 new subscribers who joined in the last week.
โ
Poll results show many of you want more emails from me to keep up to date on the latest developments. From now on I'll default to twice a week, but I'll only do once a week if news are slow, to avoid spamming you.
โ
Whatโs brewing in AI this week:
- OpenAI co-founder Andrej Karpathy quits
- Poll: ChatGPT vs Gemini
- A couple of cool GPTs
- The other top stories in AI this week
โ
Darioโs Picks:
I. Andrej Karpathy quits OpenAI
โ
โ
OpenAIโs founding member Andrej Karpathy (previously AI director at Tesla) announced on X that he already left the company, with a plan to work on personal projects. He emphasised that no issues or drama led to his departure.
Why it matters: Andrej was a respected research scientist at OpenAI, and widely considered one of the brightest people in the world of AI. He says there were no issue leading to him quitting, but itโs worth noting that chief scientist Ilya Sutskeverโs currently has an unclear status at the company, with some sources saying he has become invisible. Wouldnโt call it a conspiracy to suggest that there seems to some tensions or turbulence in the research division of OpenAI.
โ
II. OpenAI is testing cross-chat memory in ChatGPT
โ
โ
OpenAI is testing cross-chat memory in ChatGPT for a small group of users, allowing it to remember details across all conversations. As you chat, it starts to pick up details by itself, and will get better over time. Users can edit or clear the memory, similar to browser history, and also interact with this feature conversationally ("remember this" or "forget that").
Why it matters: The idea of ChatGPT remembering stuff about you to make it more useful is alluring. Iโm a bit sceptical about how useful this feature will be in practice though. Iโve found that leaving details about me in custom instructions comes in handy only in very specific cases, mostly in terms of instructions/formatting of the output. Iโve also experienced certain details Iโve entered in custom instructions as counterproductive, because it evokes the memory when itโs not relevant or youโre looking to explore something new.
While I definitely believe weโll have AIs that remember every detail about us soon enough, Iโm curious if this new feature will be able to identify the right information to save, and being able to use it without over relying on it.
โ
III. OpenAI is reportedly developing AI agents that can do things without a prompt.
OpenAI is said to be working on new products that not only generate text, but can actually automate your workflow. Things like transferring data from a document to a spreadsheet, booking flights and hotels, e-mail a group of people. This could potentially automate stuff you do, particularly web-based tasks, that normally requires you to click around, move the cursor, type text, etc.
Why it matters: AI agents that can handle workflow all by themselves could really change how we work with AI. Imagine if you were able to not only generate a travel itinerary with AI, but actually have it book your next vacation based on your criteria. With this type of agents, OpenAI is delving into similar territory as Rabbit and Hypewrite, focusing on AIs that can take action in the real world.
โ
IV. Nvidiaโs Chat with RTX
Nvidiaโs new demo app, Chat with RTX, makes it easy to run an LLM (Mistral or Llama 2) locally on your PC that can dig through docs, notes, videos, or other data. Itโs faster and more secure than doing it in the cloud.
Note. You need Windows and a RTX 30- or 40-series GPU with at least 8GB of VRAM in order to run it.
Why it matters: This seems like a simple but useful idea. Havenโt tested it myself as I use iOS, but as the Verge pointed out, it can be a valuable tool for researchers and journalists needing to securely and rapidly analyse a collection of documents. It can also handle YouTube videos, but this is less impressive to me as it isnโt generally a privacy-demanding task, and thereโs already a range of GPTs that can readily do the same.
โ
In Focus
Most of you are ChatGPT plus users, but I reckon quite a few of you have already tried Gemini Ultra as well. Letโs compare Gemini Ultra to GPT-4.
โ
*SEEย THEย EMAILย VERSIONย OFย THISย NEWSLETTERย TOย VOTE*
โ
My experience so far: Gemini Ultra is faster and has no usage cap, with Google services integrations as an advantage (although I havenโt personally found much use in it yet). However, it often struggles with simple tasks, forgets chat context faster than GPT-4, and generates inferior images compared to DALL-E and Midjourney. GPT-4 is still my go-to.
Gemini Ultra isnโt yet on Huggin Faceโs LMSys leaderboard, but its little brother Gemini Pro ranks 3rd, behind GPT-4 Turbo models. It will be interesting to see where it Ultra ranks when it eventually appears.
โ
GPTs
Editorโs Choice
Weekly picks from me to you
โ
AutoGPT Agent
๐ฌ10K+
Plans, researches, strategizes, and works to complete tasks semi-autonomously using multi-modal tools, enabling completion of tasks with minimal user input.
โ
โ
โ The name is a bit exaggerated โ it wonโt go and automate your whole workflow โ but itโs still a great GPT. Iโve tested it versus default GPT-4 on things like sentiment analysis of stocks and some market research tasks. It did a notably better job at giving a structured, thoroughly researched answer.
โ
Featured GPT (Sponsored)
RPG Game Creator
๐ฌ400+
Crafts an RPG from your prompt and generates a link to play the game.
โ
โ
โ It generates an 8-bit Final Fantasy style game based on your prompt. You can also make sequels to your games based on existing games. It didnโt completely adhere to my prompt (doesnโt look futuristic, and no robots), but I give it two thumbs up for innovation.
โ
Bytes
General
- Eleven Labsโ new voice actor payout program: Turn your voice into passive income in 30 minutes.
- Microsoft Copilot has a new look and improved image generation features.
- WSJ did an article on early adopters expressing disappointment with Microsoft Copilot, questioning how much value it adds. Transcribing meetings on Teams is the most useful feature.
- AI was in the spotlight during the Super Bowl last week, with ads for Microsoft Copilot among others. Check out the AI-related ads here.
Legislation
- US Government restricts calls using AI voices. The law gives state attorneys general power to prosecute robocallers using AI voices, regardless of whether the call is deceptive or annoying. The exception is if the recipient of the call has given prior consent or in the case of an emergency.
Research
- Interesting, and a little unsettling: ยผ Americans are using AI to create content and/or photos when dating online. 39% of guys using AI to help them write a Valentines message. 69% says AI content gave them better responses.
- A recent study found that ChatGPT demonstrates higher rationality scores than humans in decision making, including when contemplating two options. GPT could have the potential in assisting human decision-making.
- While most U.S. adults say AI's risks outweigh its benefits, Gen-Z (particularly college students and fresh graduates) think it can give them a career edge.
โ