Technology

Nvidia Unveils NIM for Seamless Deployment of AI Models in Production

Published

4 months ago

March 19, 2024

Nvidia unveiled Nvidia NIM, a new software platform intended to speed up the deployment of personalized and pre-trained AI models into production environments, at its GTC conference today. By combining a model with an optimized inferencing engine and packing it into a container that can be accessed as a microservice, NIM takes the software work that Nvidia has done around inferencing and optimizing models and makes it easily accessible.

According to Nvidia, if the company had any internal AI talent at all, it would normally take developers weeks, if not months, to ship similar containers. For businesses looking to accelerate their AI roadmap, Nvidia’s NIM clearly aims to build an ecosystem of AI-ready containers that use its hardware as the base layer and these carefully chosen microservices as the main software layer.

Currently, NIM supports open models from Google, Hugging Face, Meta, Microsoft, Mistral AI, Stability AI, A121, Adept, Cohere, Getty Images, and Shutterstock in addition to models from NVIDIA. To make these NIM microservices available on SageMaker, Kubernetes Engine, and Azure AI, respectively, Nvidia is already collaborating with Amazon, Google, and Microsoft. Additionally, they’ll be incorporated into LlamaIndex, LangChain, and Deepset frameworks.

In a press conference held prior to today’s announcements, Manuvir Das, Nvidia’s head of enterprise computing, stated, “We believe that the Nvidia GPU is the best place to run inference of these models on […] and we believe that NVIDIA NIM is the best software package, the best runtime, for developers to build on top of so that they can focus on the enterprise applications — and just let Nvidia do the work to produce these models for them in the most efficient, enterprise-grade manner, so that they can just do the rest of their work.”“

TensorRT, TensorRT-LLM, and Triton Inference Server will be the inference engines used by Nvidia. Nvidia microservices that will be made available via NIM include the Earth-2 model for weather and climate simulations, cuOpt for routing optimizations, and Riva for customizing speech and translation models.

The Nvidia RAG LLM operator, for instance, will soon be available as a NIM, a move that the company hopes will simplify the process of creating generative AI chatbots that can extract unique data.

Without a few announcements from partners and customers, this wouldn’t be a developer conference. Presently, NIM’s clientele includes companies like Box, Cloudera, Cohesity, Datastax, Dropbox, and NetApp.

NVIDIA founder and CEO Jensen Huang stated, “Established enterprise platforms are sitting on a goldmine of data that can be transformed into generative AI copilots.” “These containerized AI microservices, developed with our partner ecosystem, are the building blocks for enterprises in every industry to become AI companies.”

Up Next

Mustafa Suleyman, a Cofounder of DeepMind, is Appointed CEO of Microsoft AI by Satya Nadella

Don't Miss

AWS and Nvidia Collaborate on AI Advancement Infrastructure

Kajal Chavan

Technology

OpenAI Launches SearchGPT, a Search Engine Driven by AI

Published

24 hours ago

July 26, 2024

Archana Suryawanshi

The highly anticipated launch of SearchGPT, an AI-powered search engine that provides real-time access to information on the internet, by OpenAI is being made public.

“What are you looking for?” appears in a huge text box at the top of the search engine. However, SearchGPT attempts to arrange and make sense of the links rather than just providing a bare list of them. In one instance from OpenAI, the search engine provides a synopsis of its discoveries regarding music festivals, accompanied by succinct summaries of the events and an attribution link.

Another example describes when to plant tomatoes before decomposing them into their individual types. You can click the sidebar to access more pertinent resources or pose follow-up questions once the results are displayed.

At present, SearchGPT is merely a “prototype.” According to OpenAI spokesman Kayla Wood, the service, which is powered by the GPT-4 family of models, will initially only be available to 10,000 test users. According to Wood, OpenAI uses direct content feeds and collaborates with outside partners to provide its search results. Eventually, the search functions should be integrated right into ChatGPT.

It’s the beginning of what may grow to be a significant challenge to Google, which has hurriedly integrated AI capabilities into its search engine out of concern that customers might swarm to rival firms that provide the tools first. Additionally, it places OpenAI more squarely against Perplexity, a business that markets itself as an AI “answer” engine. Publishers have recently accused Perplexity of outright copying their work through an AI summary tool.

OpenAI claims to be adopting a notably different strategy, suggesting that it has noticed the backlash. The business highlighted in a blog post that SearchGPT was created in cooperation with a number of news partners, including businesses such as Vox Media, the parent company of The Verge, and the owners of The Wall Street Journal and The Associated Press. “News partners gave valuable feedback, and we continue to seek their input,” says Wood.

According to the business, publishers would be able to “manage how they appear in OpenAI search features.” They still appear in search results, even if they choose not to have their content utilized to train OpenAI’s algorithms.

According to OpenAI’s blog post, “SearchGPT is designed to help users connect with publishers by prominently citing and linking to them in searches.” “Responses have clear, in-line, named attribution and links so users know where information is coming from and can quickly engage with even more results in a sidebar with source links.”

OpenAI gains from releasing its search engine in prototype form in several ways. Additionally, it’s possible to miscredit sources or even plagiarize entire articles, as Perplexity was said to have done.

There have been rumblings about this new product for several months now; in February, The Information reported on its development, and in May, Bloomberg reported even more. A new website that OpenAI has been developing that made reference to the transfer was also seen by certain X users.

ChatGPT has been gradually getting closer to the real-time web, thanks to OpenAI. The AI model was months old when GPT-3.5 was released. OpenAI introduced Browse with Bing, a method of internet browsing for ChatGPT, last September; yet, it seems far less sophisticated than SearchGPT.

OpenAI’s quick progress has brought millions of users to ChatGPT, but the company’s expenses are mounting. According to a story published in The Information this week, OpenAI’s expenses for AI training and inference might total $7 billion this year. Compute costs will also increase due to the millions of people using ChatGPT’s free edition. When SearchGPT first launches, it will be available for free. However, as of right now, it doesn’t seem to have any advertisements, so the company will need to find a way to make money soon.

Technology

Google Revokes its Intentions to stop Accepting Cookies from Marketers

Published

4 days ago

July 23, 2024

Archana Suryawanshi

Following years of delay, Google has announced that it will no longer allow advertisers to remove and replace third-party cookies from its Chrome web browser.

Cookies are text files that websites upload to a user’s browser so they can follow them around when they visit other websites. A large portion of the digital advertising ecosystem has been powered by this practice, which makes it possible to track people across many websites in order to target ads.

Google stated in 2020 that it would stop supporting certain cookies by the beginning of 2022 after determining how to meet the demands of users, publishers, and advertisers and developing solutions to make workarounds easier.

In order to do this, Google started the “Privacy Sandbox” project in an effort to find a way to safeguard user privacy while allowing material to be freely accessible on the public internet.

In January, Google declared that it was “extremely confident” in the advancement of its plans to replace cookies. One such proposal was “Federated Learning of Cohorts,” which would essentially group individuals based on similar browsing habits; thus, only “cohort IDs”—rather than individual user IDs—would be used to target them.

However, Google extended the deadline in June 2021 to allow the digital advertising sector more time to finalize strategies for better targeted ads that respect user privacy. Then, in 2022, the firm stated that feedback had indicated that advertisers required further time to make the switch to Google’s cookie replacement because some had resisted, arguing that it would have a major negative influence on their companies.

The business announced in a blog post on Monday that it has received input from regulators and advertisers, which has influenced its most recent decision to abandon its intention to remove third-party cookies from its browser.

According to the firm, testing revealed that the change would affect publishers, advertisers, and pretty much everyone involved in internet advertising and would require “significant work by many participants.”

Anthony Chavez, vice president of Privacy Sandbox, commented, “Instead of deprecating third-party cookies, we would introduce a new experience in Chrome that lets people make an informed choice that applies across their web browsing, and they’d be able to adjust that choice at any time.” “We’re discussing this new path with regulators and will engage with the industry as we roll it out.”

Technology

Samsung Galaxy Buds 3 Pro Launch Postponed Because of Problems with Quality Control

Published

1 week ago

July 20, 2024

Archana Suryawanshi

At its Unpacked presentation on July 10, Samsung also debuted its newest flagship buds, the Galaxy Buds 3 Pro, with the Galaxy Z Fold 6, Flip 6, and the Galaxy Watch 7. Similar to its other products, the firm immediately began taking preorders for the earphones following the event, and on July 26th, they will go on sale at retail. But the Korean behemoth was forced to postpone the release of the Galaxy Buds 3 Pro and delay preorder delivery due to quality control concerns.

The Galaxy Buds 3 Pro went on sale earlier this week in South Korea, Samsung’s home market, in contrast to the rest of the world. However, allegations of problems with quality control quickly surfaced. These included loose case hinges, earbud joints that did not sit flush, blue dye blotches, scratches or scuffs on the case cover, and so on. It appears that the issues are exclusive to the white Buds 3 Pro; the silver devices are working fine.

Samsung reportedly sent out an email to stop selling Galaxy Buds 3 Pros, according to a Reddit user. These problems appear to be a result of Samsung’s inadequate quality control inspections. Numerous user complaints can also be found on its Korean community forum, where one consumer claims that the firm would enhance quality control and reintroduce the earphones on July 24.

A Samsung official stated. “There have been reports relating to a limited number of early production Galaxy Buds 3 Pro devices. We are taking this matter very seriously and remain committed to meeting the highest quality standards of our products. We are urgently assessing and enhancing our quality control processes.”

“To ensure all products meet our quality standards, we have temporarily suspended deliveries of Galaxy Buds 3 Pro devices to distribution channels to conduct a full quality control evaluation before shipments to consumers take place. We sincerely apologize for any inconvenience this may cause.”

Should Korean customers encounter problems with their Buds 3 Pro devices after they have already received them, they should bring them to the closest service center for a replacement.

Possible postponement of the US debut of the Galaxy Buds 3 Pro

Samsung seems to have rescheduled the launch date and (some) presale deliveries of the Galaxy Buds 3 Pro in the US and other markets by one month. Inspect your earbuds carefully upon delivery to make sure there are no issues with quality control, especially if your order is still scheduled for July.

The Buds 3 Pro is currently scheduled for delivery in late August, one month after its launch date, on the company’s US store. Additionally, Best Buy no longer takes preorders for the earphones, and Amazon no longer lists them for sale.

There are no quality control difficulties affecting the Buds 3, and they are still scheduled for delivery by July 24, the day of launch. Customers of the original Galaxy Buds 3 Pro have reported that taking them out is easy to tear the ear tips. Samsung’s delay, though, doesn’t seem to be related to that issue.