This week started off as a really slow week in AI but as the week progressed more and more news started to come out that got more and more exciting and instead of breaking it down in chronological order as the news happened I want to start off with the good stuff.
The most informational Breakdown of Big AI News
There was a lot of AI-related news this week that you probably missed. Here you can see the Breakdown of Big AI news from The week of June 24th, 2023. Here in this article, I will break down for you the biggest AI news of the week from MidJourney V5.2, Stable Diffusion XL 0.9, Meta Voicebox, and Elon Musk Vs Zuckerberg Cage Match as well as many more like Dropbox Adding AI, YouTube Adding Dubbing, Grammys Allowing AI Music, Celebrities Selling Their AI Likeness, Marvel Secret Invasion Uses AI, ChatGPT Data Leak? and Ilumine AI Part 2.
On Thursday mid-journey announced version 5.2 It got quite a few updates including new Aesthetics more variation in the generations, you can use a new shortened command to find out which words are not really impacting your prompt at all.
But the most interesting new feature in my opinion of the new mid-journey is the zoom-out feature. So if we jump over to Mid Journey here I generated a handful of images of wolves in the snow here
And here’s how you’d use the zoom-out feature. I really like this image number three here Let’s go ahead and upscale number three and as soon as I upscale it you can see some new options here You have a very strong very subtle zoom out 2x zoom out 1.5 X custom Zoom or make Square.
So let’s click the zoom out 2X and see what happens you can see it took that original image and actually made a zoomed-out image with the wolf even further back and you can continue to zoom out
Let’s say I like number three again let’s upscale number three Let’s zoom out two more times. You can see once again it further zoomed out on the image if I take one of these images and click Make Square you can see it turned him into squares and filled in the top and the bottom. This is mid-Journey’s current answer to generative fill in Photoshop and uncropped from stability AI.
let’s also quickly test the shortened feature Let’s take one of these longer prompts that I’ve used it in the past to get these really detailed images
Let’s copy this and let’s use the new shortened prompt and paste this whole prompt in. You can see that it tells me that I don’t need to use the word features I don’t need to use the word lens accent lighting or Global.
If I cut all of those out it shouldn’t really impact the result of the image. So not only will this new update help you zoom further and further out on any image you want which will be really helpful when you get those images with cropped shoulders or cropped head it also helps turn you into a better prompter.
Stable Diffusion XL 0.9
Now this mid-Journey version of Five Point 2 News drops on the Stable Diffusion XL 0.9 exactly the same day that stability AI announced their stable diffusion XL 0.9.
Now a lot of people were speculating that the full version of stable diffusion XL was going to be launched today They gave us 0.9 basically telling us that they don’t feel like the full sdxl 1.0 is fully ready yet. But it does claim to produce massively improved image and composition detail over its predecessor. Here are some examples they give on the website of the beta version of sdxl versus the new 0.9 version of sdxl.
You could see a little bit of improvement in realism and quality here’s a hand holding up a coffee cup. Sdxl 0.9 has one of the largest parameter counts of any open-source image model.
Now they say that the API and the ability to run it from Dream Studio will be available to access on Monday the 26th. You can actually play with it right now inside of clip drop which is one of stability AI’s front-end user interfaces. So if we look at Clip Drop here we can actually test the new version right now You can find it over at Clipdrop.co stable diffusion.
Let’s do the ultimate test a woman holding out her hands it seems like a lot of people are probably using it right now. So it’s loading kind of slow but the first image that it generated uh less than desirable results. Here are some other examples that were presumably generated with the newest version of stable diffusion.
For the most part, they actually look pretty good. In fact, if I scroll down here you can see an image of silhouettes of hands that actually have the proper amount of fingers and don’t look too deformed. This car is looking really good the people are looking really good This is getting much closer to the types of images that we would expect from a MidJourney.
Now stable diffusion is closing that Gap and getting better and better with these types of images. Now, unfortunately, since I’m writing this the day that the newest version of stable diffusion actually dropped the site seems a little overwhelmed right now and I’m struggling to generate more images myself.
But I can get it to generate a few decent images from time to time like this pretty realistic wolf that I just generated it’s getting much much better.
Now this was actually announced last week but it was announced after I Prepared my last blog post. Let’s talk about it now Meta introduced Voice the Box They claim it’s the most versatile AI for speech generation and to me, it sounds very similar to what 11 Labs can do right now, where with just two seconds of audio, it can generate very realistic text-to-speech. Here are some examples from Meta’s website a voice box.
Now I’m unclear if the Mark Zuckerberg’s voice that you’re hearing in this blog post was actually generated with this duel or if this is actually Mark Zuckerberg talking if I had to guess they probably generated this audio with their text-to-speech system because that would be very well, meta.
If you give it text it can read it in a bunch of different styles Penelope Porcupine and Sami slum Penelope Porcupine and Sammy Sloth.
They also say you can use this tool to fix background noise because it knows your voice it can sort of edit out background noise and then replace your voice with the fixed version of the Text-to-Speech voice.
So it can do in-context text-to-speech synthesis using an audio sample as short as two seconds long speech editing and noise reduction cross-lingual style transfer. So you can actually write something out in a different language and it will use your voice but in that language.
It has diverse speech sampling so that you get more variation in the way the voices sound. Now you can learn a lot more about this on Meta’s website I’ll make sure I link up the article here. But as far as us getting access to it they claim they’re sort of holding it back for ethical reasons,
I don’t totally understand that because there are already tools out there that do this like 11 Labs like Uber Duck like the script. So I’m not quite understanding why Meta is holding it back I imagine we’ll probably see this as an open-source model at some point but as to when that’s really anybody’s guess.
Dropbox Adding AI
Now in the news that I find quite exciting this week, Dropbox announced that they’re adding more AI features to their platform.
Dropbox AI is a new feature that lets you instantly summarize
“Dropbox AI is a new feature that lets you instantly summarize and ask questions about your Dropbox files. With Dropbox AI now you can pull up a file you can ask it anything. Dropbox will read the document for you and give you an answer and in a single click you can easily get a summary of the entire 100 page doc. Saving you all the time and extra effort that it would otherwise have taken to do that manual we’re starting with individual files but this is just the first step.”
Now this is interesting to me because there are so many tools out there right now that that is their main selling proposition you upload a PDF and we’ll summarize it for you or you can ask questions about it well.
Now if you have a DropBox account you’re just going to be able to do that I also believe this is something that Google is planning on adding into project Tailwind as well Are features like this from larger incumbent companies just sort of obsoleting a whole bunch of smaller companies right now because of this feature possibly.
He Further said:
“Over time you’ll be able able to ask a question about your Dropbox folders and even your entire Dropbox. We all need a search box for our private information just like Google gave us a search box for public information not only with your Dropbox files but with everything that’s why today we’re introducing Dropbox Dash.”
He further said:
“Dash is an AI powered Universal search engine that connects all of your tools apps and content in a single search bar. Lets you easily find whatever you’re looking for. No matter what app it’s in who sent it to you what you named it how long it’s been since you last looked at it. You can find it super fast.
Dashconnects across your favorite tools like Google workspace HubSpot Asana and notion so that you can access all of your content no matter where it lives or what format it’s in and because Dash is powered by Machine learning it learns and evolves with you and it gets better the more you use it.”
So this is the part that was the most exciting to me because imagine just taking all of your files your documents your videos, your audio pretty much any file type you can imagine.
Plus you have all of your apps that you use Maybe you’ll use Notion maybe you use HubSpot, maybe you use Google Drive and Google Sheets and Google Docs or Asana or any tool like that it sounds like Dropbox is going to tie. This all together into sort of a personalized search engine where you can search all of your files and all of the various tools that you use across various platforms.
As He said Google gives us the search engine to search the web it sounds like Dropbox is trying to give us that search engine to search all of our own personal data. Now I’m not going to show the whole video here but they do say at the end of the video that they promise not to share your data that they’re going to keep it private and that you can trust them.
Although they want you to connect every single tool at every app and every piece of your life to Dropbox they promise that they’re going to keep it safe for you so we’ll see how that plays out.
YouTube Adding Dubbing
Also this week it was announced that YouTube is getting AI-powered dubbing which is absolutely amazing to me because I am a Creator I love making YouTube videos.
Right now the options to watch my videos are either to watch them in English or to turn on subtitles and whatever language is your native language but pretty soon YouTube is just going to automatically overdub into whatever language you want.
So the words that I’m speaking right now in my videos will be coming out overdubbed in whatever language you’re listening to right now.
As per the Verge article,
“YouTube made this announcement on Thursday at VidCon that they’re working with the team from a company called aloud which is an AI power dubbing service from Google’s incubator so basically the tool first transcribes your video you can then review the transcription and edit it it then translates and produces the dub.”
Dubbing with aloud is as simple as editing texts. So if you’re a content creator this is huge news and I’m personally really really excited about this. It’s going to open up all new parts of the world of people that can learn about the types of news and stuff that I’m talking about in AI so really really exciting times to be a content creator.
So that’s some of the Cool Tech that was announced or rolled out this week.
Grammys Allowing AI Music
Now let’s talk about some of the weirder aspects of AI that are happening right now.
This week the recording Academy the organization that runs the Grammys announced their rules about AI-generated music, they said that
“Songs that include elements generated by AI can still be nominated but there must be proof that a real person meaningfully contributed to the song too.
They further said: if there’s an AI Voice singing this song or AI instrumentation we’ll consider it but in a songwriting based category it has to have been written mostly by a human they added that AI will unequivocally shape the future of the music industry and instead of downplaying its significance the Grammy Awards should confront questions related to AI ahead on.
Celebrities Selling Their AI Likeness
Also, I’m not sure if you’ve seen the first episode of Black Mirror yet I’m going to do my best not to share any spoilers about it but if you don’t want to know any information about it whatsoever maybe skip to the next chapter real quick.
The first episode of the newest season of Black Mirror is called Joan is Awful and it involves Ai and deep-faking celebrity likenesses. Let’s just put it that way what’s interesting is the same week that this episode was released.
“Celebrities are attempting to co-opt this AI craze they are making deals with Brands to put AI created duplicates of themselves into marketing campaigns giving them more control over their own likeness and more latitude in the types of Deals they can make.
They further said: Celebrities get paid but they don’t have to turn up Stars just need to spend a few minutes in a studio with a 3D scanner which could then create representations of them for countless hours of content.
And golf legend Jack Nicholas recently agreed with AI company Soul machines to create an ai-powered version of himself at the peak of his career when he was 38 years old the company metaphysic has actually signed on to provide AI services for a forthcoming Robert zemeckis film starring Tom Hanks.”
Now I have given the post a link to this Wall Street Journal article Above. But I highly recommend reading it because if you’ve seen that episode of Black Mirror called Joan is awful. This is literally the news story that’s showing that that plot is literally coming true in real life.
Now there are other aspects to that episode that I don’t feel will really likely play out in real life but again I want to avoid spoilers for anybody who hasn’t seen it yet Definitely go watch that episode if you’re a fan of AI-like I am.
Marvel Secret Invasion Uses AI
Now speaking of the entertainment industry this week Marvel was criticized for using AI to make the Secret Invasion opening credits.
Secret Invasion is a brand new Disney Plus series I haven’t watched it myself yet so I literally can’t spoil it for you what I can tell you is that their new intro which you can see on the screen right now was actually generated with AI tools.
I don’t know exactly which AI tools they used I’m guessing is some form of stable diffusion D Forum or warp fusion or one of those tools that are out there this was done with a combination of visual effects artists and AI over the top of those visual effects.
Now supposedly this was intentionally designed to look unnerving they wanted to give it that AI effect and quite honestly it’s created great publicity around the show because a lot of people are talking about the fact that this intro was AI-generated and making more people aware that there is a brand new Marvel show over on Disney plus right now.
So the marketing behind making this an AI-generated video in my opinion was actually genius.
And the creators of the intro wrote an exclusive to The Hollywood Reporter that the opening that used AI cost no artist jobs:
“Method Studios clarifies reports that sparked a social media backlash stating AI tools complemented and assisted our creative teams.”
If I was the marketing person behind Secret Invasion, I probably would have personally been the one to leak the fact that we used AI to make the intro to stir up controversy only to follow up with a report later that nobody lost their jobs and AI complimented what we created with visual effects because all of that controversy leads to views.
ChatGPT Data Leak?
Now speaking of controversy this week it was reported that the chat GPT user account credentials were found on the dark web
“Chat GPT users should be wary that their personal data might have been leaked online following the dump of more than one hundred thousand chat GPT account credentials on the dark web.However, openai did respond to this saying that it was more like 26 802 the leaked accounts mostly came from the Asia Pacific region and then it was actually due to malware that was installed on a ton of people’s devices and not an actual breach of something at open AI.“
So open AI wasn’t hacked but somehow a lot of people installed something that essentially fished their data.
So if you haven’t been installing any weird Shady software lately or downloading anything that you don’t know where it came from you’re probably in the clear but it’s always a good idea to change up your password from time to time.
Illumine AI Part 2
I want to Circle back around to something that I talked about earlier this week from this company illuminates AI. I actually dubbed in additional footage from this company because I thought what they were making was so cool and I wanted to show you another example of it just real quick because this stuff is just nerd heaven for me.
Here’s an image that we generated in mid-journey illumin AI then turned that image into this 3D world that we’re seeing on the screen.
You can move around and zoom in and zoom out of it and then they were able to drop a character into this 3D scene and actually make the character wander around inside of this 3D World.
This was originally a mid-journey image that we’re looking at that was turned into the 3D world that a character could now run around it like this to me is just so insanely cool that we’re able to do stuff like this. And as of right now this Instaverse tool from Illumine AI is actually free to play around with.
Elon Musk Vs Zuckerberg Cage Match
All right finally to wrap up this week’s news let’s talk about the most bizarre thing that’s going on on the planet right now where Mark Zuckerberg is ready to fight Elon Musk in a cage match what?
So that’s right Meta is actually planning on building a competitor to Twitter Elon Musk has trolled him a little bit on Twitter about this. It’s all sort of escalated into Elon Musk tweeting that he’s up for a cage match with Mark Zuckerberg it further escalated because Mark Zuckerberg screenshotted the Tweet saying I’m up for a cage match.
And then said send me the location now on first view you’re probably thinking this is all just like trolling and goofing off on Twitter. Well, it kind of isn’t because Mark Zuckerberg actually trained in Jiu-Jitsu and claims to train a little bit in MMA as well.
According to Dana White from the UFC they are absolutely dead serious about making this happen now this is a report from TMZ so take that for what it’s worth.
“But in an interview with TMZ on Thursday white said he probably spent an hour and a half on the phone with both of the world’s most high profile Tech billionaires and found out that neither of them was kidding about a proposed cage match stimming from a social media spat. They both said yeah we’ll do it they both want to do it he said Mark Zuckerberg hit me up first and said is Elon serious.
I said I don’t know let me ask him so I asked him and he said yeah I’m dead serious. There’s actually a non-zero percent chance we might see an actual cage match between Elon Musk and Mark Zuckerberg now that’s gonna be interesting.
Anyway, that’s all I got for you in the world of AI when the week started I thought Man this is going to be a slow week there’s really nothing to talk about for Friday’s news blog post but by the time Thursday rolled around.
It really ramped up and I don’t even know what happened on Friday yet because I’m Writing this on Thursday. So there could be new news that I haven’t even reported on yet but if there is I’ll report it in next Friday’s blog post. Also since I’m doing these Friday news blog posts on a regular basis. Now I’m trying to come up with a clever name for what I should call them.
I’ve been thinking about just going along with the name of the AI news you probably haven’t heard of yet because that seems to do well with the YouTube algorithm. But if you have ideas let me know in the comments I’d love to hear them. In this blog website, I’ll keep you in the loop with all the latest AI news all the coolest AI breakthroughs a handful of tutorials from time to time, and give you a grand overview of the TLDR.
Is OpenAI Reveals DALL-E 3 text-to-image model?
Yes, OpenAI is going to reveal the DALL-E 3 text-to-image model to sufficiently understand and analyze user preferences when encountering complex and long prompts. OpenAI’s plan implementation for DALL-E 3 is to include an initial release to ChatGPT ‘Plus’ and ‘Enterprise’ buyers to come a month.
What are the top 5 most prominent AI companies currently?
The top 5 most prominent AI companies with the highest market cap currently in the USA include Microsoft with $2.390+ trillion, Alphabet Google with $1.697+ trillion, NVIDIA with $1.106+ trillion, Tesla with $798.57+ billion and IBM with $128.26+ billion market cap respectively.
Why Big data analytics are the most popular technological advances in the Age of AI?
Big data analytics are the most popular technological advances in the Age of AI because it has a tight connection and perform jointly to build strategic value for companies as well as help businesses to invent unique products and services that satisfy the markets and needs and conditions of their consumers. Also, Big data analytics have huge demand and importance in the AI world due to AI algorithms which is given to Big data analytics to equip the tools and strategies to process, reserve and interpret huge number of data that helps businesses and companies to get helpful understandings insights and patterns.
What are the reasons behind Apple’s decision to hire UK AI staff despite the constant release trend?
The reason behind Apple’s decision to hire UK AI staff despite the constant release trend is that Apple’s company wants to take on better staff to perform in artificial intelligence (AI) as well as redundancies seen across the tech sector. It is more difficult to predict the AI-powered economy of tomorrow so Apple’s company desires to work together on a long-term plan that sets exercise and activities in digital skills and lifelong understanding at the core.