Issues / ISSUE #2409

Weekly AI Discovery: Generative AI Revolutionizes Sound, Vision, and Beyond

In this weekly’s AI news, NVIDIA's Fugatto model shatters boundaries, empowering users to create captivating soundscapes, transform voices, and produce never-before-heard audio with just text and audio inputs. Meanwhile, Georgia Tech's "Chameleon" AI offers a sophisticated digital mask to shield personal photos from unwanted facial recognition, ushering in a new era of digital privacy. From Amazon's Olympus video analysis AI to the funding raises powering the next generation of AI agents and engagement platforms, the world is witnessing a seismic shift in how we interact with technology. Brace yourself for a future where generative AI redefines the limits of what's possible.

Weekly AI Discovery: Generative AI Revolutionizes Sound, Vision, and Beyond

Jump to:

Top 10 AI News #weekly

Called Fugatto (short for Foundational Generative Audio Transformer Opus 1), it generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.

For example, it can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice — even let people produce sounds never heard before.

Further Reading

AI could hold the key to hiding your personal photos from unwanted facial recognition software and fraudsters, all without destroying the image quality.

A new study from Georgia Tech university, published July 19 to the pre-print arXiv database, details how researchers created an AI model called "Chameleon," which can produce a digital "single, personalized privacy protection (P-3) mask" for personal photos that thwarts unwanted facial scanning from detecting a person's face. Chameleon will instead cause facial recognition scanners to recognize the photos as being someone else.

Further Reading

Nov 27 (Reuters) - E-commerce giant Amazon (AMZN.O) has developed new generative artificial intelligence (AI) that can process images and videos in addition to text, making it less reliant on AI startup Anthropic, The Information reported on Wednesday.

Further Reading

Former Google, Stripe Executives Raise $56 Million for AI Agent Startup /dev/agents.

The funding round was co-led by Index Ventures and CapitalG.

Further Reading

Runway announced its new base model for image generation, Frames, this week. The new model represents “a big step forward in stylistic control and visual fidelity,” the AI startup, which develops multimodal AI systems for video, image, and audio generation, said.

Further Reading

Photon Image Model by Luma is a next generation image model built from the ground-up for visual thinking and fast iteration. Luma’s breakthrough new universal architecture enables Photon to generate high resolution, highly detailed, creatively composed images at 8x the efficiency and speed of other comparable models.

Further Reading

The Death Clock app, launched in July, predicts life expectancy using AI and data. With 125,000 downloads, it aids health decisions and financial planning.

Further Reading

Generative artificial intelligence (AI) is reshaping the human resources (HR) profession, offering opportunities to enhance efficiency and infuse processes with a more human touch. According to a new study by Bain & Company, generative AI can help companies save an average of 15-20% in HR labour time through automation and augmentation. This transformative technology has the potential to reduce costs while enabling HR to become a more strategic function within organizations.

Further Reading

A new so-called “reasoning” AI model, QwQ-32B-Preview, has arrived on the scene. It’s one of the few to rival OpenAI’s o1, and it’s the first available to download under a permissive license.

Developed by Alibaba’s Qwen team, QwQ-32B-Preview contains 32.5 billion parameters and can consider prompts up ~32,000 words in length; it performs better on certain benchmarks than o1-preview and o1-mini, the two reasoning models that OpenAI has released so far. (Parameters roughly correspond to a model’s problem-solving skills, and models with more parameters generally perform better than those with fewer parameters. OpenAI does not disclose the parameter count for its models.)

Further Reading

Perplexity, the AI-powered search engine, wants to get into hardware — kinda sorta.

Aravind Srinivas, Perplexity’s founder and CEO, posted on X on Monday that he was considering making a “simple, under $50” device to “reliably answer” questions “voice to voice.” He promised that Perplexity would “definitely” sell such a device if the post got more than 5,000 likes.

Further Reading

Best Article(s) #weekly

Write or Write-Nots by Paul Graham

The reason so many people have trouble writing is that it's fundamentally difficult. To write well you have to think clearly, and thinking clearly is hard.

Not anymore. AI has blown this world open. Almost all pressure to write has dissipated. You can have AI do it for you, both in school and at work.

The result will be a world divided into writes and write-nots. There will still be some people who can write. Some of us like it. But the middle ground between those who are good at writing and those who can't write at all will disappear. Instead of good writers, ok writers, and people who can't write, there will just be good writers and people who can't write.

Read More

Best Open Source Alternatives to Proprietary Software#weekly

MemFree Hybrid AI Search Engine

MemFree Hybrid AI Search Engine

MemFree is a Hybrid AI Search Engine where you can get accurate answers by searching and asking questions with text, images, files, and web pages.

Read More

Latest AI Tools In CogList #weekly

Brevo

Brevo is an all-in-one marketing platform for email marketing, SMS marketing, CRM, and marketing automation.

Brevo Review

Beehiiv

Beehiiv is an all-in-one email marketing platform that manages newsletters, from sending them via email to blogging in general.

Beehiiv Review

Mailchimp

Mailchimp is a marketing platform that offers tools for email marketing and automations, including Email Campaigns, Automation, Analytics, and Website builder.

Mailchimp Review

Linkedin

LinkedIn is a social networking site for professional networking and career enhancement, which connects job seekers, recruiters, and businesses effectively.

Linkedin Review

Pinterest

Pinterest allows users to "pin" images, videos, and other content to virtual boards, from home decor to recipes.

Pinterest Review

Twitter

Twitter is a social networking website for real-time communication and trending updates that enables users to send and read text, photos, videos and links.

Twitter Review

Instagram

Instagram is a social networking site that allows users to share photos and videos, follow other users, and like, comment, or direct message posts.

Instagram Review

Facebook

Facebook is a social networking site where users can connect with friends and family, share content, and be part of many communities based on interest.

Facebook Review

This Week's Summary

This weekly's AI news highlights the remarkable advancements in generative AI, showcasing how these cutting-edge technologies are revolutionizing various industries. NVIDIA's Fugatto model empowers users to create and manipulate audio in unprecedented ways, while Georgia Tech's "Chameleon" AI provides a digital shield against facial recognition, addressing growing privacy concerns. The passage also covers Amazon's Olympus video analysis AI, as well as the significant funding raises fueling the development of AI-powered agents and engagement platforms. These innovations signal a future where generative AI will redefine the boundaries of what's possible, transforming how we interact with technology and the world around us.

Our Rating System

To maintain objectivity and fairness, our news or project selection has not been influenced by any advertisers.