Connect with us

Technology

The Three Biggest Advancements in AI for 2023

Published

on

The Three Biggest Advancements in AI for 2023

In many ways, the year 2023 marked the beginning of people’s understanding of artificial intelligence (AI) and its potential. That was the year governments started to take AI risk seriously and the year chatbots went viral for the first time. These advancements weren’t so much new inventions as they were concepts and technologies that were coming of age after a protracted gestation period.

However, there were also a lot of fresh inventions. These are the top three from the previous year:

Differentiation

Although the term “multimodality” may sound technical, it’s important to know that it refers to an AI system’s capacity to handle a wide variety of data types, including audio, video, images, and text.

This year marked the first time that robust multimodal AI models were made available to the general public. The first of these, GPT-4 from OpenAI, let users upload images in addition to text inputs. With its ability to “see” images, GPT-4 offers up a plethora of possibilities. For instance, you could ask it to decide what to have for dinner based on a picture of what’s in your refrigerator. OpenAI released the capability for users to communicate with ChatGPT via voice and text in September.

Announced in December, Google DeepMind’s most recent model, Gemini, is also capable of processing audio and images. In a Google launch video, the model was shown using a post-it note with a line drawing to identify a duck. In the same video, Gemini came up with an image of a pink and blue plush octopus after being shown a picture of pink and blue yarn and asked what they could make. (The promotional film gave the impression that Gemini was watching moving images and reacting to voice commands in real time. However, Google stated in a blog post on its website that the video had been trimmed for brevity and that the model was being prompted with text prompts rather than audio and still images, even though the model does have

“I think the next landmark that people will think back to, and remember, is [AI systems] going much more fully multimodal,” Google DeepMind co-founder Shane Legg said on a podcast in October. “It’s early days in this transition, and when you start really digesting a lot of video and other things like that, these systems will start having a much more grounded understanding of the world.” In an interview with TIME in November, OpenAI CEO Sam Altman said multimodality in the company’s new models would be one of the key things to watch out for next year.

Multimodality offers benefits beyond making models more practical. The models can also be trained on a wealth of new data sets, including audio, video, and images, which together contain more information about the world than text can. Many of the world’s leading AI companies hold the view that these models will become more powerful or capable as a result of this new training data. It is a step toward “artificial general intelligence,” the kind of system that can equal human intellect, producing labor that is economically valuable and leading to new scientific discoveries. This is the hope held by many AI scientists.

AI under the Constitution

How to integrate AI with human values is one of the most important unsolved issues in the field. If artificial intelligence and power surpass that of humans, these systems have the potential to unleash immense damage on our species—some even predict its extinction—unless they are somehow restrained by laws that prioritize human well-being.

The method that OpenAI employed to align ChatGPT (in order to steer clear of the racist and sexist tendencies of previous models) was successful, but it necessitated a significant amount of human labor. This method is called “reinforcement learning with human feedback,” or RLHF. If the AI’s response was beneficial, safe, and adhered to OpenAI’s list of content guidelines, human raters would evaluate it and award it the computational equivalent of a dog treat. OpenAI created a reasonably safe and efficient chatbot by rewarding the AI for good behavior and punishing it for bad behavior.

However, the RLHF process’s scalability is seriously questioned due to its heavy reliance on human labor. It costs a lot. It is susceptible to the prejudices or errors committed by certain raters. The longer the list of rules, the greater the likelihood of failure. And it doesn’t seem like it will work for AI systems that get so strong that they start doing things that are incomprehensible to humans.

Constitutional AI, which was initially introduced in a December 2022 paper by researchers at the prestigious AI lab Anthropic, aims to solve these issues by utilizing the fact that AI systems are now able to comprehend natural language. The concept is very straightforward. You start by creating a “constitution” that outlines the principles you want your AI to uphold. Subsequently, the AI is trained to grade responses according to how closely they adhere to the constitution. The model is then given incentives to produce responses that receive higher scores. Reward learning from AI feedback has replaced reinforcement learning from human feedback. The Anthropic researchers stated that “these methods make it possible to control AI behavior more precisely and with far fewer human labels.” Anthropic’s 2023 response to ChatGPT, Claude, was aligned using constitutional AI. (Among the investors in Anthropic is Salesforce, whose CEO and co-chair of TIME is Marc Benioff.)

“With constitutional AI, you’re explicitly writing down the normative premises with which your model should approach the world,” Jack Clark, Anthropic’s head of policy, told TIME in August. “Then the model is training on that.” There are still problems, like the difficulty of making sure the AI has understood both the letter and the spirit of the rules, (“you’re stacking your chips on a big, opaque AI model,” Clark says,) but the technique is a promising addition to a field where new alignment strategies are few and far between.

Naturally, Constitutional AI does not address the issue of whose values AI ought to be in line with. However, Anthropic is attempting to make that decision more accessible to all. The lab conducted an experiment in October wherein it asked a representative sample of one thousand Americans to assist in selecting rules for a chatbot. The results showed that, despite some polarization, it was still possible to draft a functional constitution based on statements that the group reached a consensus on. These kinds of experiments may pave the way for a time when the general public has far more influence over AI policy than it does now, when regulations are set by a select group of Silicon Valley executives.

Text to Video

The rapidly increasing popularity of text-to-video tools is one obvious result of the billions of dollars that have been invested in AI this year. Text-to-image technologies had just begun to take shape a year ago; today, a number of businesses are able to convert sentences into moving pictures with ever-increasing precision.

One of those businesses is Runway, an AI video startup with offices in Brooklyn that aims to enable anyone to make movies. With its most recent model, Gen-2, users can perform video-to-video editing—that is, altering an already-existing video’s style in response to a text prompt, such as transforming a picture of cereal boxes on a tabletop into a nighttime cityscape.

“Our mission is to build tools for human creativity,” Runway’s CEO Cristobal Valenzuela told TIME in May. He acknowledges that this will have an impact on jobs in the creative industries, where AI tools are quickly making some forms of technical expertise obsolete, but he believes the world on the other side is worth the upheaval. “Our vision is a world where human creativity gets amplified and enhanced, and it’s less about the craft, and the budget, and the technical specifications and knowledge that you have, and more about your ideas.” (Investors in Runway include Salesforce, where TIME co-chair and owner Marc Benioff is CEO.)

Pika AI, another startup in the text-to-video space, claims to be producing millions of new videos every week. The startup, which is headed by two Stanford dropouts, debuted in April but has already raised money valued at between $200 and $300 million, according to Forbes. Free tools like Pika, aimed more at the average user than at professional filmmakers, are attempting to change the face of user-generated content. Though text-to-video tools are computationally expensive, don’t be shocked if they start charging for access once the venture capital runs out. That could happen as soon as 2024.

Technology

Sony has Revealed the PlayStation 5, PS5 Pro, and a Limited Edition PSOne-Style Device

Published

on

In the PlayStation video below, the collection was unveiled along with a limited-edition DualSense, DualSense Edge, and PlayStation Portal. All products will be available for preorder starting on September 26 and will ship on November 21.

There are just 12,300 PS5 Pro bundles available worldwide, making it probably the most limited edition. Included in the bundle are the DualSense Edge controller and case in the iconic PlayStation design, along with the PS5 Pro itself, a vertical stand, a charging station, and other accessories. Along with four cable ties designed in the image of the sacred symbols, an original PlayStation controller-style cable connector, a sticker, poster, and paperclip are also included.

The PlayStation 5 Slim Digital Edition is the other console that is available; however, if you want to utilize discs on the retro-themed console, you will still need to purchase a matching disc drive cover. Along with four cable ties designed to resemble sacred symbols, a sticker, poster, paperclip, and the original PlayStation controller-style cable connector, it also includes a standard DualSense controller and a vertical stand.

Sony withheld the pricing of any of the collection’s products, but enthusiasts who aren’t willing to shell out a lot of cash will be glad to hear that the DualSense controller and DualSense Edge will be sold individually.

As stated on the PlayStation Blog, preorders will be accessible to PlayStation Network account holders via PlayStation Direct on September 26. Additionally, between September 26 and October 10, participating shops will be taking preorders for the DualSense controller and the PS5 Digital Edition. On September 26, those who wish to preorder PlayStation Direct in nations where it is not accessible can do so at participating retailers.

At a technical conference on September 10, PlayStation unveiled the PS5 Pro and its $699.99 price tag. Those who like to use discs and stand their PS5 Pro upright will have to shell out approximately $805 since the regular model does not come with a disc drive or vertical stand.

Continue Reading

Technology

Google experiments with Android tablets’ desktop windowing

Published

on

Google is testing a new feature for Android tablets that would allow you to easily rearrange apps on your screen and resize them, which will facilitate multitasking. Developer previews of the “desktop windowing” functionality are now accessible, and you can even run multiple instances of the app simultaneously if they support it.

At the moment, Android tablet apps always open in full screen mode. Each program will show up in a window with controls to let you move, maximize, or close it when the new mode is enabled. Moreover, your open programs will be listed in a taskbar at the bottom of the screen.

It sounds a lot like Stage Manager for the iPad, which allows you to do the same with windows on your screen, or with almost any desktop operating system. For years, Samsung has also provided its DeX experience, which gives Android apps on Galaxy phones and tablets desktop-like window management.

When the functionality becomes available to all users, you may activate it by tapping and holding the window handle located at the top of an application’s screen. The shortcut meta key (Windows, Command, or Search) + Ctrl + Down can also be used to enter desktop mode if a keyboard is connected. (You can drag a window to the top of your screen to dismiss the mode, or you can close all of your open apps.)

Apps that are locked to portrait orientation can still be resized, according to Google, which could have odd visual effects if some apps aren’t optimized. Google intends to fix this in a later release, though, by scaling non-resizable apps’ user interfaces without changing their aspect ratios.

For the time being, users with the most recent Android 15 QPR1 Beta 2 for Pixel Tablets can access the developer preview.

Continue Reading

Technology

Sony Faces Backlash for Pricing PlayStation 5 Pro Well Above Xbox

Published

on

Sony Group Corp. has set the price of its new, faster PlayStation 5 Pro at $700, significantly higher than Microsoft’s Xbox Series X, which costs $600. The PlayStation 5 Pro, launching on November 7, comes at a $200 premium over the original PS5, suggesting Sony is targeting a loyal audience willing to pay extra for enhanced performance.

This pricing positions both Sony and Microsoft at the high end of the gaming console market. Four years into their product life cycles, the two most popular home consoles are moving towards premium models. Analysts are split on whether Sony’s pricing strategy will drive sales, especially as it seeks to grow its entertainment portfolio across gaming, anime, and film.

Industry analyst Serkan Toto described the PlayStation 5 Pro as a niche device aimed at hardcore PlayStation users, rather than a mass-market offering. “It’s about Sony skimming the absolute top end of the market,” he said, with the gaming world questioning Sony’s high pricing.

Others speculate that Sony’s pricing strategy is aimed at boosting margins, particularly after recent price hikes in Japan due to rising component costs like chips. The new console will allow for higher resolution and faster frame rates without requiring users to switch between performance modes, delivering 45% faster rendering than the standard PS5, according to lead architect Mark Cerny.

Despite the steep price, some analysts believe Sony could benefit. Citi analyst Kota Ezawa pointed out that no previous game console successor has been priced significantly higher than the original model, and that the PS5 Pro’s improved components may not justify such a big price jump. Nevertheless, the higher price could enhance Sony’s gross margins.

The PlayStation 5, which has sold over 59 million units since its 2020 release, has slightly lagged behind the PlayStation 4. The increased cost of the PS5 Pro may narrow its appeal, as the price edges closer to that of a gaming PC—one of the console market’s biggest competitors.

Reviewers also highlighted the lack of a disc drive in the new model, reflecting a broader industry shift from physical media to digital content. A disc drive will be available separately for purchase.

In a blog post, Sony announced that the PS5 Pro would enhance the performance of older titles, with several popular games such as Hogwarts Legacy, Final Fantasy VII Rebirth, and Spider-Man 2 receiving free updates to take advantage of the console’s new features.

Continue Reading

Trending

error: Content is protected !!