
AI Updates
StreamYard:
News:
Sutskever Out: OpenAI cofounder and chief scientist Ilya Sutskever has officially departed OpenAI following the Sam Altman coup. https://twitter.com/ilyasut/status/1790517455628198322 https://mashable.com/article/openair-ilya-slutskever-leaves-chief-scientist
Create a local version of your AI Chatbot. Create an OpenAI-like AI assistant with Llama-3 deployed locally on your computer (100% free and without internet access). https://www.reddit.com/r/LocalLLaMA/comments/1cpxhye/create_openai_like_ai_assistant_with_llama3/
NYT Lawsuit: NYT has spent $1M so far in its lawsuit against OpenAI. https://nytco-assets.nytimes.com/2024/05/Q124-Earnings-Release-Final-For-Distribution-rhHXIYbe.pdf
Using AI: The three most common activities completed on ChatGPT include exploring new topics, researching products, and finding recipes. https://twitter.com/TurnerNovak/status/1789847620740890806
Claude expands: Claude is now available for people and businesses across Europe. https://www.anthropic.com/news/claude-europe
Where will we look? Search engine volume will drop 25% by 2026 due to AI chatbots and other virtual agents, according to Gartner. https://www.gartner.com/en/newsroom/press-releases/2024-02-19-gartner-predicts-search-engine-volume-will-drop-25-percent-by-2026-due-to-ai-chatbots-and-other-virtual-agents
Chatter by Hume: Interactive podcast experience https://chatter.hume.ai/
ChatGPT 4o
Hello GPT-4o https://openai.com/index/hello-gpt-4o/
Introducing GPT-4o and more tools to ChatGPT for free users https://openai.com/index/gpt-4o-and-more-tools-to-chatgpt-free/
More Multimodal: Talk and See
Be my eyes
Expand learning and using
Voice Mode
Feels like chatting with a real human
It captures your tone, language, and expressions in real-time.
Many are describing it as a real-life Her
Desktop App (MAC only for now)
Consider these possibilities:
Upload a PowerPoint and let ChatGPT-4o suggest layout tweaks, rephrase slide titles, and improve the design.
Use ChatGPT-4o to inspect a spreadsheet and highlight trends, anomalies, or discrepancies. Or for tech support.
GPT-4o can guide customers through visual step-by-step instructions for installing or setting up products.
Other updates not in the demo:
For developers, GPT-4o is half the price, twice as fast as GPT-4-turbo, and has 5x rate limits.
Way better at writing text correctly in DALL-E 3 images.
It can create fonts.
It can generate 3D visualizations.
Gemini (Google I/O Conference)
100 Announcements https://blog.google/technology/ai/google-io-2024-100-announcements/
Will GAI Search Kill SEO: https://blog.google/products/search/generative-ai-google-search-may-2024/
AI for all your workspaces: This means paying users will have a ChatGPT-esque assistant right beside your screen that knows everything your Google apps know about you
Powered by Gemini1.5 Pro, it now boasts a super-duper big context window of 2M tokens—meaning it can "remember" 1.4M words. https://one.google.com/about/ai-premium/
Working in Docs/Sheets/Gmail/Slides, you can ask Gemini to retrieve or summarize any content from all these apps https://workspace.google.com/solutions/ai/
“Hey Gemini, fetch the latest budget numbers from Sheets and pop them into this email.”
“Hey Gemini, condense the main points from the email chain with our marketing team into a new Doc.”
The Agents Are Coming: Google is prepping basic versions of AI agents that can perform tasks. https://twitter.com/chiefaioffice/status/1790484696557596866
An agent to categorize all receipts in your inbox into a GSheet
An email agent to return an order and schedule a UPS pickup
An agent to manage tasks like updating addresses across multiple websites
AI Teammate: a coworker popping up in chat groups, emails, and documents. If it knows the answer to a question, it’ll answer just like any other employee would. https://mashable.com/article/google-io-2024-ai-teammate
Project Astra: Google’s ultimate AI assistant that can see and reason about what's around you. It’s similar to ChatGPT-4o and might power a future pair of glasses. Watch the demo above. https://www.youtube.com/watch?v=nXVvvRhiGjI&t=16s
Veo: Google’s new AI video generator. to try and keep up with Sore. While it is not as good, there are people with access. https://www.youtube.com/watch?v=diqmZs1aD1g
Imagen 3: Google’s newest text-to-image model. Genuinely impressed by this one. The people look very, very real (see here). https://twitter.com/GoogleDeepMind/status/1790434750592643331
Music AI Sandbox: Google’s take on creating music using AI. Listen to the demos here. https://twitter.com/GoogleDeepMind/status/1790435413682975043
New Models:
Gemini 1.5 Flash—a smaller, speedier version of Gemini 1.5 Pro.
Gemma 2—Google’s best open-source models.
PaLI-3—a fresh open-source vision model.
Gemini Live—a talking feature for Gemini similar to ChatGPT Voice Mode coming later this year.
Tools to explore
Brilliant: offers bite-sized AI lessons so you stay competitive at work https://brilliant.org/
Otto: an AI biographer that records your memories and transforms them into published stories! https://www.landing.ottowrites.co/
Smartazor: Edit YouTube videos in half the time https://smartrazor.ai/
Creator Tools
Claude: A conversational AI platform designed for nuanced and contextually aware interactions, aiming to provide human-like conversation experiences. https://claude.ai/chats
ChatGPT: A platform offering access to OpenAI's GPT model, tailored for engaging in conversational responses, providing information, and generating text-based content. https://chatgpt.com/
Pika Labs: A creative platform focused on AI-driven art creation, allowing users to explore and create digital artworks with the assistance of artificial intelligence. https://pika.art/
Runway: A creative toolkit powered by AI, enabling users to apply machine learning models to video, image, and text projects for innovative content creation. https://runwayml.com/
Leonardo: An AI platform (not widely known as of my last update, so this description is speculative) likely aimed at enhancing digital art creation or providing AI-based tools for creative processes. https://leonardo.ai/
Storyblocks: A stock media platform offering royalty-free videos, images, and audio clips for content creators to use in their projects. https://www.storyblocks.com/
Dalle-3 on Bing: Microsoft’s integration of OpenAI's DALL-E model into Bing allows users to generate unique images based on textual descriptions through a web interface. https://www.bing.com/images/create
Ideogram: An AI-driven platform focused on generating ideograms or visual symbols that represent ideas or concepts, facilitating creative visual communication. https://ideogram.ai/t/explore
MusicFX: A Google experiment that allows users to explore the creation of music using AI, part of Google's AI Test Kitchen, focusing on experimental AI applications in music. https://aitestkitchen.withgoogle.com
Mubert: An AI-powered platform for generating unique music streams, allowing creators to produce music through algorithmic composition. https://mubert.com/
Suno: An AI tool focused on enhancing voice communication by offering real-time, AI-driven voice analysis and improvement features (based on the description, this is speculative). https://www.suno.ai/
ElevenLabs: Offers AI technology for creating realistic voice synthesis and cloning, enabling high-quality voice generation for various applications. https://elevenlabs.io/
Adobe Speech Enhancer: A tool designed to improve the quality of audio recordings, especially in podcasting, by removing noise and enhancing speech clarity. https://podcast.adobe.com/enhance
Timebolt: A video editing software that automates the process of cutting out silences from video content, making content creation more efficient. https://www.timebolt.io
Descript: A multi-tool platform for audio and video editing that features transcription, text-to-speech, and media editing capabilities, aimed at content creators. https://www.descript.com/
HeyGen: Provides AI-driven tools for generating written content, aiming to assist in the creative writing process with the help of artificial intelligence. https://heygen.com/
Opus Clip: A platform (as of my last update, not widely recognized, so description is speculative) likely focused on video editing or creation tools enhanced by AI. https://www.opus.pro/
TubeBuddy: A browser extension and mobile app designed to help YouTube creators optimize their videos, manage their channels, and grow their audience. https://futuretools.link/tubebuddy-com
Invideo: An online video creation platform offering tools and templates for creating professional-quality videos for marketing, social media, and more. https://invideo.io/
LTX Studio: Specializes in leveraging AI for text-to-video generation, allowing users to create video content from written narratives. https://ltx.studio/