Hello Reader,

Promptcraft is a curated newsletter on AI for education designed to elevate your AI literacy.

In this issue:

  • OpenAI releases GPT-4o, the most advanced model, for free
  • ​Google introduces LearnLM, a family of AI models for learning
  • Gemini Comes to Workspace for Education

Let’s get started!

~ Tom Barrett

7uLEiCQ1A1dgu9f2QMrfzy

ACCESS

.: OpenAI releases GPT-4o, the most advanced model, for free

Summary ➜ OpenAI has released GPT-4o, a new, free AI model that surpasses GPT-4 by a significant margin. This model is groundbreaking as it offers multimodal capabilities, including text, audio, video, and image processing. By making GPT-4o free, OpenAI has disrupted the AI landscape and set a new standard for competitors.

Why this matters for education ➜ I decided to curate this as the lead story because it concerns access to the most powerful AI capabilities.

There has always been a big difference between the GPT-3.5 model and GPT-4, the latter being part of a paid subscription. With this update, any educator or student with a free account can explore the most powerful model.

When using GPT-4o, ChatGPT Free users will also get access to features such as:

  • Experience GPT-4 level intelligence
  • Get responses from both the model and the web
  • Analyse data and create charts
  • Chat about photos you take
  • Upload files for summarising, writing or analysing
  • Discover and use GPTs and the GPT Store
  • Build a more helpful experience with Memory

Of course we need safe connections to these powerful tools and we should question what the trade-off might be in this scenario. But it is exciting to think every teacher with a connected device across the globe can access this powerful and flexible tool.

Read more of my reflections below about the OpenAI announcement and the updates to Voice mode.

uLw14vy5k9rAK8oAKAoezE

EDU

.: Google introduces LearnLM, a family of AI models for learning

Summary ➜ Google is introducing LearnLM, a new family of generative AI models designed to improve learning and teaching experiences. LearnLM is integrated into various Google products.

Why this matters for education ➜ The initial approach is to integrate the fine-tuned model, which is based on Gemini, into YouTube, Search, and Classroom to provide features like summarising videos, brainstorming ideas, generating quizzes, and offering personalised help. Although those use cases are pretty flimsy, they are a start, and some better examples were shared in the announcement. Two experiments were announced within Labs, Illuminate – turning research papers into AI-generated podcast audio and Learn About, which explores how information can turn into understanding by bringing together high-quality content, learning science and chat experiences. Despite the limited access and uncertainty of whether they will ever turn into full-blown products, they are an interesting glimpse into the future of learning tools with AI and Google.

oP3UzhYRXLkVQh2RetkgK9

TOOLS

.: Gemini Comes to Workspace for Education

Summary ➜ Google has integrated Gemini, its generative AI, into its Education suite within Google Workspace. The Gemini Education base plan, available for educators and students over 18, enhances Docs, Sheets, Slides, and Meet with AI capabilities and chat functionality.

Why this matters for education ➜ When Google makes a move, it has a greater impact on education than other companies due to the extensive use of their tools. However, I think this is diluted here because the cost of Gemini for Education starkly contrasts the freely available best-in-class model GPT-4o from OpenAI. Pricing for the Gemini Education base plan is $24/month or $192/year per user, while the premium plan is $36/month or $288/year per user. Integrating AI into tools like Docs provides a different experience of generative AI than chatbots; this integration is central to the paid plans. My experience with these Google Workspace AI integrations has been patchy, and much improvement is needed.

.: Other News In Brief

🤖 Google strikes back at OpenAI with “Project Astra” AI agent prototype

📁 ChatGPT now lets you import files directly from Google Drive, Microsoft OneDrive

🌍 Microsoft’s carbon emissions up nearly 30% thanks to AI

📸 Instagram co-founder joins Anthropic as chief product officer in fight against OpenAI

🎥 Google unveils Veo, a high-definition AI video generator that may rival Sora

🇺🇸 US Senators unveil proposed $32 billion roadmap for regulating AI

🎵 Sony Music warns tech companies over ‘unauthorised’ use of its content to train AI

💰 Hugging Face is sharing $10 million worth of compute to help beat the big AI companies

📱6 Things Announced at Google I/O, One That Mattered A Lot, and One Mystery

🖼️ Google still hasn’t fixed Gemini’s biased image generator

:. .:

What’s on my mind?

.: Closer Synthetic Conversation

This week has been filled with exciting tutoring demonstrations and educational AI announcements from OpenAI and Google, with more likely to come from Microsoft and Apple soon. What’s been on my mind, though, is a particular improvement in the responsiveness of voice models demonstrated by OpenAI’s new Voice mode with GPT-4o and Google’s Gemini Live demo in Kings Cross (which also featured some intriguing glasses!).

video preview

While shaving milliseconds off response times might not seem like much, the demonstrations we’ve seen, especially from OpenAI, show that AI systems are integrating more fluidly into human dialogue patterns. This is a significant change. Reduced latency—the processing or wait time—means we are more engaged. We might stick around in voice chats longer and feel that these systems are more helpful.

There was even a deliberate demonstration of interrupting the AI system as it spoke, highlighting this improvement in user experience. Both OpenAI and Google showcased how their systems will utilise live vision through the camera, tapping into spatial data as a rich source of new information for training once research labs have exhausted other data sources.

If you haven’t experienced using the frontier AI systems in voice chats, I highly recommend giving it a try. It is the most fluid and natural way to interact, and it seems that the vision of these leading labs places voice control at the forefront.

Putting aside the technical advancements for a moment, let’s consider what this means for us as humans. The proximity to human dialogue has changed in these demonstrations. The speed of interaction is crucial, but you’ll also notice the intonation, the personality, and the informality in the communication style. Remember, all of this is designed, intentional, and deliberate.

I’m curious to know what it feels like to experience synthetic conversational fluency. It’s already quite fascinating, and I’m intrigued by how millisecond changes can profoundly affect our experience of synthetic digital relationships.

~ Tom

Prompts

.: Refine your promptcraft

This week I am featuring one of the recommended prompts shared by Anthropic, the research lab behind the Claude family of AI models.

Second-grade simplifier

Make complex text easy for young learners to understand.

You could use the following System prompt in your own GPT, Gem or Bot. Or use both within a normal chatbot session.

System Prompt

Your task is to take the text provided and rewrite it in a way that is easy for young learners in grades 3-5 to read and understand. Simplify advanced vocabulary, break down long sentences, explain difficult concepts in plain language, and present the information in a clear, engaging way. The short rewritten text should convey the core ideas of the original text in an age-appropriate manner.

Here is some example text from the technical research paper linked below about LearnLM.

In our current work, models 𝑀0–𝑀4 are fine-tuned via SFT over all parameters of a base model
(PaLM 2.0 [109] for 𝑀0–𝑀3 and Gemini 1.0 [10] for 𝑀4 of comparable size; see Section E for further
implementation details). While reward modeling and RL are crucial (and in our opinion the most
promising) ingredients to building high-quality gen AI tutors, we have thus far focused only on SFT
(and the requisite creation of behaviour cloning data). Of course, this puts our models at a serious
disadvantage in evaluations against the base models, which include both SFT and (non-pedagogical)
RL, and we plan to incorporate RL in the future (see Section F for a discussion of the challenges that
come with eliciting human preferences to support RL for educational use cases).

And here is the 3 paragraph result using our simplifier prompt with GPT-4o.

Right now, we are working on improving our computer programs, called models. We have five models, M0 to M4. We make them better by teaching them with special training called SFT. Four of them, M0 to M3, start with a basic program called PaLM 2.0, and the fifth one, M4, starts with a different program called Gemini 1.0.
We think using reward modeling and RL (a way of teaching by rewarding good actions) is very important to make these models great teachers. But for now, we are only using SFT. This means our models might not be as good as the original ones because they don’t have the extra RL training.
In the future, we plan to add RL to our models. This will help them learn better by understanding what people like and want.

:. .:

Remember to make this your own, try different language models and evaluate the completions.

Do you have a great prompt you would like me to share in a future Promptcraft issue? Drop me a message by replying to this email.

Learning

.: Boost your AI Literacy

SPATIAL AI

.: With Spatial Intelligence, AI Will Understand the Real World | Fei-Fei Li | TED

video preview

INTRO
.: AI Explained: It’s maths, not magic (a simple guide) from Google and Australia’s CSIRO

video preview

RESEARCH
.: Towards Responsible Development of Generative AI for Education: An Evaluation-Driven Approach

Dig into the research and development of Google’s LearnLM model.

Grounded in educational research and tailored to how people learn, LearnLM represents an effort across Google DeepMind, Google Research and our product teams to help make learning experiences more engaging, personal and useful. Our technical report presents our approach to improving generative AI for education and highlights how we’re working together with the AI and EdTech communities to responsibly maximize its positive impact and potential.

Ethics

.: Provocations for Balance

Scenario 1: The Perfect Listener

OpenAI releases a groundbreaking update to its AI, enabling unprecedented conversational fluency and emotional responsiveness. In classrooms, these AI assistants become the go-to for students, offering instant feedback, personalised tutoring, and empathetic listening. Teachers notice a significant increase in student engagement and academic performance. However, as students grow more attached to these AI companions, they begin to confide in them more than in their human teachers or peers. This deep reliance raises concerns about the erosion of human relationships and the potential for AI to misinterpret or misuse sensitive information. The school board must decide whether to continue using these highly responsive AI assistants, which clearly benefit education, or to limit their use to ensure that human connections and emotional intelligence remain central to students’ development.

Scenario 2: Human Touch

In the midst of a nationwide teacher shortage, a small town’s school district struggles to fill vacant teaching positions. To cope, the district relies heavily on the remaining human teachers, who are overwhelmed and burnt out. Desperate for a solution, the school board introduces advanced AI teaching assistants powered by GPT-4o to support the overworked staff. Initially, the AI assistants help alleviate the workload, allowing teachers to focus on more meaningful interactions with students. However, as the AI takes on more responsibilities, it begins to undermine the authority and expertise of the human teachers. The community must decide whether to continue relying on AI to fill the gap, risking the devaluation of the teaching profession, or to seek alternative solutions that might be less efficient but preserve the integrity and importance of human educators.

Scenario 3: Eye of the Beholder

A leading tech company releases an innovative AI system called “SpatialSense,” designed to enhance learning by utilising spatial data from students’ environments. This AI collects data through cameras and sensors, analyzing everything from classroom layouts to students’ body language and social interactions. It customizes lessons in real-time, tailoring content to each student’s unique spatial and social context. Initially, the results are impressive: engagement and comprehension rates soar. However, as the AI becomes more integrated into the educational system, concerns arise about privacy and the ethical use of such detailed personal data. When a student discovers that the AI has recorded sensitive moments without consent, the school system faces a dilemma.

.: :.

Inspired by some of the topics this week and dialled up.

:. .:

How would you rate this issue of Promptcraft?

(Click on your choice below)

⭐️⭐️⭐️⭐️⭐️ Loved it!

⭐️⭐️⭐️ Good

⭐️ Poor

If you have any other kind, specific and helpful feedback, please reply to this email or contact me at tom@dialogiclearning.com

.: :.

The more we invest in our understanding of AI, the more powerful and effective our education ecosystem becomes. Thanks for being part of our growing community!


.: Tom Barrett