Issues / ISSUE #2405

From Stable Diffusion 3.5 to GitHub Spark: A Week of Groundbreaking AI Tools and Open-Source Innovations

Discover this week's AI revolution! From Stability AI's game-changing release to OpenAI's enhanced search capabilities and GitHub's natural language app builder, we're witnessing a transformation in how we interact and create using AI.

From Stable Diffusion 3.5 to GitHub Spark: A Week of Groundbreaking AI Tools and Open-Source Innovations

Jump to:

Top 10 AI News #weekly

Stability AI  introduced Stable Diffusion 3.5. This open release includes multiple model variants, including Stable Diffusion 3.5 Large and Stable Diffusion 3.5 Large Turbo, and as of October 29th, Stable Diffusion 3.5 Medium. 
These models are highly customizable for their size, run on consumer hardware, and are free for both commercial and non-commercial use under the permissive Stability AI Community License. 
You can download all Stable Diffusion 3.5 models from Hugging Face and the inference code on GitHub now.

Further Reading

OpenAI has released ChatGPT Search, which allows you to quickly make it search the web for your queries.

You just need to simply click the 🌐 icon.

Rolling out now to Paid users and it will roll out to Free Users in the coming months.

It also has a chrome extension: ChatGPT Search

Further Reading

SynthID watermarks and identifies AI-generated content by embedding digital watermarks directly into AI-generated images, audio, text or video, which is impossible for you to see in an image but easy for the detection tool to spot. Google’s ready and willing for it to get tested and broken.

Further Reading

Spark, which is officially an experiment the company is launching out of its GitHub Next labs, allows you to quickly build a small web app using nothing but natural language. Experienced developers can still see and edit the code and underneath it all is a GitHub repository, GitHub Actions, and Microsoft’s Azure CosmosDB as the default database for applications that need one  but that’s optional.

Ideally, you’ll be able to use a chat-like experience to create a prototype and then refine it in subsequent steps.

It needs a waitlist. Apply here:Apply

Further Reading

 1) Meet ClaudeDev: an open source autonomous AI programmer in VScode. 

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

 2) Claude is now available on @GitHub Copilot.

Starting today, developers can select Claude 3.5 Sonnet in Visual Studio Code and GitHub.com. Access will roll out to all Copilot Chat users and organizations over the coming weeks. 

Further Reading on anthropic

3)The Claude app is now available to download on Mac and Windows: claude.ai/download.

You can now dictate messages to Claude on our iPhone, iPad, and Android apps.

Further Reading

1) Google Learn About experiment

Google has released a new product called Google Learn About.

You can deep dive into almost any topic and use Google’s suggestions.

There you can prompt any topic and deep dive into it through Google's autosuggestions. It also can use search to search for information.

 2) Gemini API and Google AI Studio now offer Search Grounding.

Search Grounding enables users to get more accurate and fresh responses from the Gemini models aided by Google Search.

google search  grounding

Say hello to Grounding with Google Search, available in the Gemini API + Google AI Studio!

You can now access real time, fresh, up to date information from Google Search when building with Gemini by enabling the Grounding tool.

Further Reading

A mysterious new image generation model is beating models from Midjourney, Black Forest Labs, and OpenAI on the crowdsourced Artificial Analysis benchmark.

The model, which goes by the name “red_panda,” is around 40 Elo points ahead of the next-best-ranking model, Black Forest Labs’ Flux1.1 Pro, on Artificial Analysis’ text-to-image leaderboard. Artificial Analysis uses Elo, a ranking system originally developed to calculate the relative skill level of chess players, to compare the performance of the various models it tests.

The company behind the mysterious Red_Panda text-to-image model is Recraft AI (model name: Recraft V3) and you can try it now on their platform and in Replicate.

Further Reading

π0 is:

- a 3B pre-trained generalist model trained on 8+ robot platforms

- a post-training recipe that allows robots to do dexterous, long-horizon tasks

Further Reading

Microsoft AI CEO Mustafa Suleyman told LinkedIn co-founder Reid Hoffman on a podcast that startups can find a niche in fine-tuning AI models with accurate examples.

Further Reading

Meet Synthflow Voice 2.0. Enjoy the new @OpenAI Realtime Voice Integration and a bunch of new features 👇🏻

- New Customisable Widget with Actions

- ML Based Voicemail Detection

- Realistic Background Noise

- Warm Call Transfers

- @cartesia_ai Voices

- and more!

Further Reading

Best Article(s) #weekly

How to build an AI search engine from scratch!

How to built TurboSeek (19k users), an OSS Perplexity clone, with Next.js + Together AI.

Read More

Anthropic published a repo with courses on how to use LLMs

Anthropic published a repo with courses on how to use LLMs.

Read More

Best Open Source Alternatives to Proprietary Software#weekly

Screenshot to Code

Screenshot to Code

Screenshot to Code can change visual designs into code for developlers that supports various frameworks, including HTML + Tailwind, React, and Vue.

Read More

Auto Scraper

Auto Scraper

AutoScraper is an automatic, light-weight Python solution for scraping web pages with a URL and some sample data.

Read More

This Week's Summary

This week showcases remarkable developments in AI.

Key highlights include Stability AI's release of Stable Diffusion 3.5 with multiple model variants for commercial use, OpenAI's launch of ChatGPT Search for enhanced web queries, and GitHub's innovative Spark platform for building web apps using natural language. We've seen significant advances in AI tooling with Google's SynthID watermarking technology and Claude's expansion to GitHub Copilot and desktop applications.

The open-source community continues to thrive with projects like Screenshot to Code and AutoScraper, while Google strengthens its AI ecosystem with Learn About experiments and Gemini API's Search Grounding. The emergence of RedPanda's mysterious image generation model and π0's robotic capabilities round out a week filled with technological breakthroughs and practical AI applications.

Our Rating System

To maintain objectivity and fairness, our news or project selection has not been influenced by any advertisers.