Connect with us

Technology

Launched by Visual Electric to free AI art creation from chat interfaces

Published

on

Launched by Visual Electric to free AI art creation from chat interfaces

There are probably some similarities that you have observed if you have experimented with at least a few of the text-to-image AI art generation services that have been introduced in recent years, like Midjourney or OpenAI’s different DALL-E versions. The most notable of them all is that they all resemble chat interfaces. The application usually responds with an image embedded in a message after the user enters their text prompts.

While this point of interaction functions admirably for some clients and application engineers, certain individuals accept it is restricting and at last not what laid out craftsmen and architects need while utilizing computer based intelligence at work. However, presently San Francisco-based Visual Electric is here to offer an alternate methodology. One that the new startup — which rises up out of covertness today following a seed round last year from Sequoia, BoxGroup, and Creator Asset of an undisclosed total — accepts is preferable adjusted to visual imagination over messaging to and fro with a man-made intelligence model.

“There’s just so many workflow-specific optimizations that you need to make if you’re a graphic designer or a concept artist,” said Colin Dunn, founder and CEO of Visual Electric, in an exclusive interview with VentureBeat. “There’s a long tail of things that will make their life way easier and will make for a much better product.”

Dunn recently drove item plan and brand at the versatile site building organization Universe, and before that, filled in as head of plan at Playspace, a Google procurement.

For enterprise users, such as independent designers, in-house designers at major brands, and even “pro-sumers,” Visual Electric aims to be that “much better product” for AI art, visual design, and creativity.

The organization is intentionally not sending off its own hidden artificial intelligence picture generator AI (ML) model. Instead, it is based on the open-source Stable Diffusion XL model, which is currently the subject of a copyright lawsuit brought by artists against Stability AI, the company that developed it, as well as Midjourney and other AI art generators.

This is due to the fact that Dunn and his two co-founders, Adam Menges, chief product officer of Visual Electric and former co-founder of Microsoft acquisition Lobe; and chief technology officer Zach Stiggelbout, who was previously employed by Lobe, are of the opinion that image generation AI models are in the process of being commoditized, and that the front-end user interface will largely determine the success and failure of businesses.

“We just want to build the best product experience,” Dunn said. “We’re really model agnostic and we’re happy to swap out whatever model is going to give users the best results. Our product can easily accommodate multiple models or the next model that’s going to come out.”

What sets Visual Electric apart from Midjourney, DALL-E 3, and other AI art apps?

What sets Visual Electric apart from previous image generators? Instead of the top-to-bottom “linear” form factor of other chat-based AI art generator apps, which force users to scroll back up to see their previous generations, it allows users to generate and drag-to-move their imagery around an infinite virtual “canvas.” Clients can keep producing new arrangements of 4 pictures all at once and move them around this material any place they’d like.

“Creativity is a nonlinear process,” Dunn said. “You want to explore; you want to go down different paths and then go back up to an idea you were looking at previously and take that in a new direction. Chat forces you into this very linear flow where it’s sort of like you have a starting point and an ending point. And that’s just not really how creativity works.”

Unlike many chat interfaces, this box has been moved to the top of the screen instead of the bottom, although there is still a space for text prompts to be entered.

To assist with conquering the underlying obstacle that a few clients face — not knowing precisely exact thing to type in to provoke the computer based intelligence to inspire it to create the picture they have to their eye — Visual Electric offers a drop-down field of autocomplete ideas, like what a client finds while composing in a pursuit on Google. All of these suggestions are based on what Visual Electric has observed from early users and what produces the best images. In any case, a client is likewise allowed to veer off from these completely and type in a custom brief too.

Moreover, Visual Electric’s electronic man-made intelligence workmanship generator offers a scope of supportive extra devices for changing the brief and style of the subsequent pictures, remembering pre-set styles that emulate normal ones for the pre-man-made intelligence computerized and printed craftsmanship universes, including “marker,” “exemplary movement,” “3D render,” “digitally embellish,” “risograph,” “stained glass,” and numerous others — with recent trends being added routinely.

It puts it in more direct competition with Adobe’s Firefly 2 AI art interface, which offers similar functionality, as the user can select their image aspect ratio from buttons on the dropdown or a convenient right-rail sidebar rather than having to specify it within the prompt text. Two common examples of this are 16:9 and 5:4.

This sidebar additionally allows the client to determine prevailing varieties and components they wish to reject from their subsequent simulated intelligence created picture, likewise inputted through text.

In addition, the user can click a button to “remix” or “regenerate” their images based on their initial prompt, or they can “touch up” specific areas of the image and have the AI regenerate only those areas that they highlight using a digital brush of a size that the user can adjust, while keeping the rest of the image intact and adding to it in the same way. So, for instance, you could “touch up” the hair of your AI-generated subject and instruct the Stable Diffusion XL model to redo only that portion of the image if you didn’t like it.

Additionally, there is a built-in upscaler that can improve image resolution and detail.

“These are the tools that represent what we see as the AI-native workflow and they in the order that you use them,” Dunn said.

Pricing, the community, and early success stories

Despite the fact that Visual Electric is going public today, the company has been quietly conducting alpha testing with a few dozen designers. Dunn claims that these designers have already provided valuable feedback that will help improve the product. Additionally, Dunn says that the promising results of how Visual Electric has been used to assist in real-world enterprise workplace situations show that the company is on the right track.

Dunn referenced one client specifically — keeping the name for classification — who had a little group of creators attempting to make menus and other visual guarantee for in excess of 600 colleges.

Previously, this group would have invested bunches of their energy figuring out stock symbolism and trying to track down pictures that matched each other yet likewise addressed genuinely the things on a school’s eating corridor menu, and having to physically alter the stock symbolism to make it more precise.

With Visual Electric, they can now create brand-new images that meet the requirements of the menu and edit portions of them without using Adobe Photoshop or other alternatives.

“They’re now able to take what was a non-creative task and make it into something that is very creative, much more fulfilling, and they can do it in a tenth of the time,” Dunn claimed.

An “Inspiration” feed of AI-generated images created on the platform by other users is another important feature that Visual Electric offers. This feed, a lattice of various estimated pictures that inspires Pinterest, permits the client to float over the pictures and see their prompts. They can also import any images from the public feed into their private canvas by “remixing” them.

“This was a early decision that we made, which is we think that with generative AI there’s an opportunity to bring the network into the tool,” Dunn explained. “Right now, you have inspiration sites like Pinterest and designer-specific sites like Dribbble, and then you have the tools like Photoshop, Creative Suite and Figma. It’s always felt odd to me that these things are not unified in some way, because they’re so related to each other.”

Clients of Visual Electric can decide to draw in with this feed and add to it or not, at their tact. For undertakings worried about the security of their symbolism and works underway, Dunn guaranteed VentureBeat that the organization views security and security in a serious way, however just the “Genius” plan offers the capacity to have secretly put away pictures — all the other things is public as a matter of course.

Sending off in the U.S. today freely, Visual Electric’s valuing is as per the following: a free plan that gives you 40 generations per day at slower speeds and a license that can only be used for personal use (you can’t sell the images or use them for marketing); a standard arrangement at $20 each month or $16/month paid every year direct, which takes into consideration local area sharing, limitless ages at 2x quicker velocities, and sovereignty free business use permit; as well as a well conceived plan for $60 each month or $48/month paid yearly direct, which offers all that the last two plans offer yet additionally significantly higher goal pictures, and fundamentally, privatized ages.

Continue Reading
Advertisement

Technology

AI Features of the Google Pixel 8a Leaked before the Device’s Planned Release

Published

on

A new smartphone from Google is anticipated to be unveiled during its May 14–15 I/O conference. The forthcoming device, dubbed Pixel 8a, will be a more subdued version of the Pixel 8. Despite being frequently spotted online, the smartphone has not yet received any official announcements from the company. A promotional video that was leaked is showcasing the AI features of the Pixel 8a, just weeks before its much-anticipated release. Furthermore, internet leaks have disclosed software support and special features.

Tipster Steve Hemmerstoffer obtained a promotional video for the Pixel 8a through MySmartPrice. The forthcoming smartphone is anticipated to include certain Pixel-only features, some of which are demonstrated in the video. As per the video, the Pixel 8a will support Google’s Best Take feature, which substitutes faces from multiple group photos or burst photos to “replace” faces that have their eyes closed or display undesirable expressions.

There will be support for Circle to Search on the Pixel 8a, a feature that is presently present on some Pixel and Samsung Galaxy smartphones. Additionally, the leaked video implies that the smartphone will come equipped with Google’s Audio Magic Eraser, an artificial intelligence (AI) tool for eliminating unwanted background noise from recorded videos. In addition, as shown in the video, the Pixel 8a will support live translation during voice calls.

The phone will have “seven years of security updates” and the Tensor G3 chip, according to the leaked teasers. It’s unclear, though, if the phone will get the same amount of Android OS updates as the more expensive Pixel 8 series phones that have the same processor. In the days preceding its planned May 14 launch, the company is anticipated to disclose additional information about the device.

Continue Reading

Technology

Apple Unveils a new Artificial Intelligence Model Compatible with Laptops and Phones

Published

on

All of the major tech companies, with the exception of Apple, have made their generative AI models available for use in commercial settings. The business is, nevertheless, actively engaged in that area. Wednesday saw the release of Open-source Efficient Language Models (OpenELM), a collection of four incredibly compact language models—the Hugging Face model library—by its researchers. According to the company, OpenELM works incredibly well for text-related tasks like composing emails. The models are now ready for development and the company has maintained them as open source.

In comparison to models from other tech giants like Microsoft and Google, the model is extremely small, as previously mentioned. 270 million, 450 million, 1.1 billion, and 3 billion parameters are present in Apple’s latest models. On the other hand, Google’s Gemma model has 2 billion parameters, whereas Microsoft’s Phi-3 model has 3.8 billion. Minimal versions are compatible with phones and laptops and require less power to operate.

Apple CEO Tim Cook made a hint in February about the impending release of generative AI features on Apple products. He said that Apple has been working on this project for a long time. About the details of the AI features, there is, however, no more information available.

Apple, meanwhile, has declared that it will hold a press conference to introduce a few new items this month. Media invites to the “special Apple Event” on May 7 at 7 AM PT (7:30 PM IST) have already begun to arrive from the company. The invite’s image, which shows an Apple Pencil, suggests that the event will primarily focus on iPads.

It seems that Apple will host the event entirely online, following in the footsteps of October’s “Scary Fast” event. It is implied in every invitation that Apple has sent out that viewers will be able to watch the event online. Invitations for a live event have not yet been distributed.
Apple has released other AI models before this one. The business previously released the MGIE image editing model, which enables users to edit photos using prompts.

Continue Reading

Technology

Google Expands the Availability of AI Support with Gemini AI to Android 10 and 11

Published

on

Android 10 and 11 are now compatible with Google’s Gemini AI, which was previously limited to Android 12 and above. As noted by 9to5google, this modification greatly expands the pool of users who can take advantage of AI-powered support for their tablets and smartphones.

Due to a recent app update, Google has lowered the minimum requirement for Gemini, which now makes its advanced AI features accessible to a wider range of users. Previously, Gemini required Android 12 or later to function. The AI assistant can now be installed and used on Android 10 devices thanks to the updated Gemini app, version v1.0.626720042, which can be downloaded from the Google Play Store.

This expansion, which shows Google’s goal to make AI technology more inclusive, was first mentioned by Sumanta Das on X and then further highlighted by Artem Russakoviskii. Only the most recent versions of Android were compatible with Gemini when it was first released earlier this year. Google’s latest update demonstrates the company’s dedication to expanding the user base for its AI technology.

Gemini is now fully operational after updating the Google app and Play Services, according to testers using Android 10 devices. Tests conducted on an Android 10 Google Pixel revealed that Gemini functions seamlessly and a user experience akin to that of more recent models.

Because users with older Android devices will now have access to the same AI capabilities as those with more recent models, the wider compatibility has important implications for them. Expanding Gemini’s support further demonstrates Google’s dedication to making advanced AI accessible to a larger segment of the Android user base.

Users of Android 10 and 11 can now access Gemini, and they can anticipate regular updates and new features. This action marks a significant turning point in Google’s AI development and opens the door for future functional and accessibility enhancements, improving everyone’s Android experience.

Continue Reading

Trending

error: Content is protected !!