Technology

MM1, a Family of Multimodal AI Models with up to 30 billion Parameters, is being Developed by Apple Researchers

Published

4 months ago

March 18, 2024

In a pre-print paper, Apple researchers presented their work on developing a multimodal large language model (LLM) for artificial intelligence (AI). The paper describes how it was possible to achieve the advanced capabilities of multimodality and train the foundation model on both text-only data and images, and it was published on an online portal on March 14. The Cupertino-based tech giant has made new advances in AI in response to CEO Tim Cook’s statement during the company’s earnings calls, which stated that AI features might be released later this year.

ArXiv, an open-access online repository for scholarly papers, has published the research paper’s pre-print version. Peer review is not, however, applied to the papers that are posted here. The project is thought to be connected to Apple as well, even though the paper makes no mention of the company; this is because the majority of the researchers mentioned are connected to the machine learning (ML) division of Apple.

A family of multimodal models with up to 30 billion parameters, known as MM1, is the project that the researchers are currently working on. The paper’s authors referred to it as a “performant multimodal LLM (MLLM)” and noted that in order to build an AI model that can comprehend both text and image-based inputs, image encoders, the vision language connector, and other architecture elements and data decisions were made.

The paper provided an example in stating that “We demonstrate that achieving state-of-the-art (SOTA) few-shot results across multiple benchmarks, compared to other published pre-training results, requires a careful mix of image-caption, interleaved image-text, and text-only data for large-scale multimodal pre-training.”

To put it simply, the AI model has not received enough training to produce the intended results and is presently in the pre-training phase. This phase involves designing the model’s workflow and data processing eventually using the algorithm and AI architecture. The researchers at Apple were able to incorporate computer vision into the model by means of a vision language connector and image encoders. Upon conducting tests using a combination of image-only, image-text, and text-only data sets, the team discovered that the outcomes were comparable to those of other models at the same stage.

Although this is a significant breakthrough, there is insufficient evidence in this research paper to conclude that Apple will integrate a multimodal AI chatbot into its operating system. It’s difficult to even say at this point whether the AI model is multimodal in terms of receiving inputs or producing output (i.e., whether it can produce AI images or not). However, it can be said that the tech giant has made significant progress toward developing a native generative AI foundation model if the results are verified to be consistent following peer review.

Up Next

NVIDIA Releases 6G Research Cloud Platform to Use AI to Improve Wireless Communications

Don't Miss

Google Gemini may be used by Apple to Power Certain AI Features on the iPhone

Kajal Chavan

Technology

OpenAI Launches SearchGPT, a Search Engine Driven by AI

Published

17 hours ago

July 26, 2024

Archana Suryawanshi

The highly anticipated launch of SearchGPT, an AI-powered search engine that provides real-time access to information on the internet, by OpenAI is being made public.

“What are you looking for?” appears in a huge text box at the top of the search engine. However, SearchGPT attempts to arrange and make sense of the links rather than just providing a bare list of them. In one instance from OpenAI, the search engine provides a synopsis of its discoveries regarding music festivals, accompanied by succinct summaries of the events and an attribution link.

Another example describes when to plant tomatoes before decomposing them into their individual types. You can click the sidebar to access more pertinent resources or pose follow-up questions once the results are displayed.

At present, SearchGPT is merely a “prototype.” According to OpenAI spokesman Kayla Wood, the service, which is powered by the GPT-4 family of models, will initially only be available to 10,000 test users. According to Wood, OpenAI uses direct content feeds and collaborates with outside partners to provide its search results. Eventually, the search functions should be integrated right into ChatGPT.

It’s the beginning of what may grow to be a significant challenge to Google, which has hurriedly integrated AI capabilities into its search engine out of concern that customers might swarm to rival firms that provide the tools first. Additionally, it places OpenAI more squarely against Perplexity, a business that markets itself as an AI “answer” engine. Publishers have recently accused Perplexity of outright copying their work through an AI summary tool.

OpenAI claims to be adopting a notably different strategy, suggesting that it has noticed the backlash. The business highlighted in a blog post that SearchGPT was created in cooperation with a number of news partners, including businesses such as Vox Media, the parent company of The Verge, and the owners of The Wall Street Journal and The Associated Press. “News partners gave valuable feedback, and we continue to seek their input,” says Wood.

According to the business, publishers would be able to “manage how they appear in OpenAI search features.” They still appear in search results, even if they choose not to have their content utilized to train OpenAI’s algorithms.

According to OpenAI’s blog post, “SearchGPT is designed to help users connect with publishers by prominently citing and linking to them in searches.” “Responses have clear, in-line, named attribution and links so users know where information is coming from and can quickly engage with even more results in a sidebar with source links.”

OpenAI gains from releasing its search engine in prototype form in several ways. Additionally, it’s possible to miscredit sources or even plagiarize entire articles, as Perplexity was said to have done.

There have been rumblings about this new product for several months now; in February, The Information reported on its development, and in May, Bloomberg reported even more. A new website that OpenAI has been developing that made reference to the transfer was also seen by certain X users.

ChatGPT has been gradually getting closer to the real-time web, thanks to OpenAI. The AI model was months old when GPT-3.5 was released. OpenAI introduced Browse with Bing, a method of internet browsing for ChatGPT, last September; yet, it seems far less sophisticated than SearchGPT.

OpenAI’s quick progress has brought millions of users to ChatGPT, but the company’s expenses are mounting. According to a story published in The Information this week, OpenAI’s expenses for AI training and inference might total $7 billion this year. Compute costs will also increase due to the millions of people using ChatGPT’s free edition. When SearchGPT first launches, it will be available for free. However, as of right now, it doesn’t seem to have any advertisements, so the company will need to find a way to make money soon.

Technology

Google Revokes its Intentions to stop Accepting Cookies from Marketers

Published

4 days ago

July 23, 2024

Archana Suryawanshi

Following years of delay, Google has announced that it will no longer allow advertisers to remove and replace third-party cookies from its Chrome web browser.

Cookies are text files that websites upload to a user’s browser so they can follow them around when they visit other websites. A large portion of the digital advertising ecosystem has been powered by this practice, which makes it possible to track people across many websites in order to target ads.

Google stated in 2020 that it would stop supporting certain cookies by the beginning of 2022 after determining how to meet the demands of users, publishers, and advertisers and developing solutions to make workarounds easier.

In order to do this, Google started the “Privacy Sandbox” project in an effort to find a way to safeguard user privacy while allowing material to be freely accessible on the public internet.

In January, Google declared that it was “extremely confident” in the advancement of its plans to replace cookies. One such proposal was “Federated Learning of Cohorts,” which would essentially group individuals based on similar browsing habits; thus, only “cohort IDs”—rather than individual user IDs—would be used to target them.

However, Google extended the deadline in June 2021 to allow the digital advertising sector more time to finalize strategies for better targeted ads that respect user privacy. Then, in 2022, the firm stated that feedback had indicated that advertisers required further time to make the switch to Google’s cookie replacement because some had resisted, arguing that it would have a major negative influence on their companies.

The business announced in a blog post on Monday that it has received input from regulators and advertisers, which has influenced its most recent decision to abandon its intention to remove third-party cookies from its browser.

According to the firm, testing revealed that the change would affect publishers, advertisers, and pretty much everyone involved in internet advertising and would require “significant work by many participants.”

Anthony Chavez, vice president of Privacy Sandbox, commented, “Instead of deprecating third-party cookies, we would introduce a new experience in Chrome that lets people make an informed choice that applies across their web browsing, and they’d be able to adjust that choice at any time.” “We’re discussing this new path with regulators and will engage with the industry as we roll it out.”

Technology

Samsung Galaxy Buds 3 Pro Launch Postponed Because of Problems with Quality Control

Published

7 days ago

July 20, 2024

Archana Suryawanshi

At its Unpacked presentation on July 10, Samsung also debuted its newest flagship buds, the Galaxy Buds 3 Pro, with the Galaxy Z Fold 6, Flip 6, and the Galaxy Watch 7. Similar to its other products, the firm immediately began taking preorders for the earphones following the event, and on July 26th, they will go on sale at retail. But the Korean behemoth was forced to postpone the release of the Galaxy Buds 3 Pro and delay preorder delivery due to quality control concerns.

The Galaxy Buds 3 Pro went on sale earlier this week in South Korea, Samsung’s home market, in contrast to the rest of the world. However, allegations of problems with quality control quickly surfaced. These included loose case hinges, earbud joints that did not sit flush, blue dye blotches, scratches or scuffs on the case cover, and so on. It appears that the issues are exclusive to the white Buds 3 Pro; the silver devices are working fine.

Samsung reportedly sent out an email to stop selling Galaxy Buds 3 Pros, according to a Reddit user. These problems appear to be a result of Samsung’s inadequate quality control inspections. Numerous user complaints can also be found on its Korean community forum, where one consumer claims that the firm would enhance quality control and reintroduce the earphones on July 24.

A Samsung official stated. “There have been reports relating to a limited number of early production Galaxy Buds 3 Pro devices. We are taking this matter very seriously and remain committed to meeting the highest quality standards of our products. We are urgently assessing and enhancing our quality control processes.”

“To ensure all products meet our quality standards, we have temporarily suspended deliveries of Galaxy Buds 3 Pro devices to distribution channels to conduct a full quality control evaluation before shipments to consumers take place. We sincerely apologize for any inconvenience this may cause.”

Should Korean customers encounter problems with their Buds 3 Pro devices after they have already received them, they should bring them to the closest service center for a replacement.

Possible postponement of the US debut of the Galaxy Buds 3 Pro

Samsung seems to have rescheduled the launch date and (some) presale deliveries of the Galaxy Buds 3 Pro in the US and other markets by one month. Inspect your earbuds carefully upon delivery to make sure there are no issues with quality control, especially if your order is still scheduled for July.

The Buds 3 Pro is currently scheduled for delivery in late August, one month after its launch date, on the company’s US store. Additionally, Best Buy no longer takes preorders for the earphones, and Amazon no longer lists them for sale.

There are no quality control difficulties affecting the Buds 3, and they are still scheduled for delivery by July 24, the day of launch. Customers of the original Galaxy Buds 3 Pro have reported that taking them out is easy to tear the ear tips. Samsung’s delay, though, doesn’t seem to be related to that issue.