Connect with us

Technology

Quantization of models and the emergence of edge AI

Published

on

Quantization of models and the emergence of edge AI

The amalgamation of edge computing and artificial intelligence holds the potential to revolutionize numerous industries. In this case, the quick development of model quantization—a method that increases portability and decreases model size to enable faster computation—is crucial.

When paired with appropriate methods and tools, edge AI has the potential to completely change how we interact with data and data-driven applications.

Why does AI edge?

Bringing data processing and models closer to the point of data generation—that is, to a remote server, tablet, IoT device, or smartphone—is the goal of edge AI. This makes real-time, low-latency AI possible. By 2025, deep neural networks will analyze more than half of all data at the edge, predicts Gartner. This paradigm change will have several benefits.

Decreased latency:

Edge AI eliminates the need to send data back and forth to the cloud by processing data directly on the device. Applications that need quick responses and rely on real-time data must take this into consideration.

Decreased complexity and costs:

Sending information back and forth doesn’t require costly data transfers when data is processed locally at the edge.

Data stays on the device, minimizing security risks related to data transmission and data leakage. This preserves privacy.

Improved scalability:

Applications can be scaled more easily without depending on a central server for processing power thanks to the decentralized strategy with edge AI.

Manufacturers can integrate edge AI, for instance, into their defect detection, quality control, and predictive maintenance procedures. Manufacturers can better utilize real-time data to decrease downtime and enhance production processes and efficiency by implementing AI and locally analyzing data from smart machines and sensors.

Model quantization’s function

AI models must be optimized for performance without sacrificing accuracy in order for edge AI to be successful. AI models are growing larger, more complex, and more intricate, which makes them more difficult to manage. This makes it difficult to deploy AI models at the edge, since edge devices frequently have low resources and are unable to support these kinds of models.

Model quantization makes the models lighter and more appropriate for deployment on resource-constrained devices like mobile phones, edge devices, and embedded systems by reducing the numerical precision of the model parameters (from 32-bit floating point to 8-bit integer, for example).

Three methods—GPTQ, LoRA, and QLoRA—have surfaced as possible game-changers in the field of model quantization:

Models are compressed as part of GPTQ after training. When deploying models in settings with constrained memory, it works perfectly.

Large pre-trained models must be adjusted for inferencing in LoRA. In particular, it adjusts the smaller matrices (called LoRA adapters) that comprise the large matrix of a model that has already been trained.

Using GPU memory for the pre-trained model makes QLoRA a more memory-efficient choice. When modifying models for new tasks or data sets with limited computational resources, LoRA and QLoRA are particularly helpful.

The particular requirements of the project, whether it is in the deployment or fine-tuning phase, and whether it has the computational resources available all play a significant role in the method selection. Developers can effectively push AI to the limit by utilizing these quantization techniques, striking a balance between efficiency and performance—a crucial aspect for many applications.

Edge platforms and use cases for AI

Edge AI has a wide range of uses. The possibilities are endless: wearable health devices that identify abnormalities in the wearer’s vitals; smart cameras that process images for rail car inspections at train stations; and smart sensors that keep an eye on inventory on store shelves. For this reason, IDC projects that spending on edge computing will amount to $317 billion by 2028. The edge is changing the way businesses handle data.

Strong edge inferencing databases and stacks will become more and more in demand as businesses realize the advantages of AI inferencing at the edge. These platforms offer all the benefits of edge AI, including lower latency and increased data privacy, while also facilitating local data processing.

Continue Reading
Advertisement

Technology

The Debut of Clever.AI was Revealed by CleverTap

Published

on

Clever.AI, the AI engine of CleverTap, one of the top all-in-one platforms for customer engagement and retention, was launched today. Through Clever.AI, CleverTap aims to provide brands with the next generation of AI capabilities needed to develop a human-like understanding of their customers and effectively deliver personalized experiences that increase customer lifetime value.

Brilliant.Predictive, generative, and prescriptive AI are the three main pillars upon which AI is based. Brilliant.These three pillars work together to revolutionize consumer engagement strategies and create more intelligent and effective customer interactions thanks to artificial intelligence (AI).

Clever.AI Gives Brands the Ability to Become:

Perceptive: Equipped with Predictive AI powers, it predicts exact business results, assisting brands in anticipating consumer demands. Astute.The TesseractDBTM, a proprietary technology from CleverTap, powers AI insights by ensuring data granularity over an extended lookback period, improving prediction accuracy, and empowering brands to make well-informed decisions that boost marketing ROI.

Empathetic: Cleverly advancing GenAI.AI creates content that speaks to people on a human level by fusing creativity and emotional intelligence. By using empathy, brands can increase conversion rates and provide hyper-personalized experiences for customers.

Actionable: By utilizing Prescriptive AI capabilities, it helps brands instantly determine the best engagement strategies to maximize conversions throughout the customer journey.

Burger King’s Digital Product Manager, Peter Takacs, gave it a 10 for usability and a wide range of potential applications. “Our marketing campaigns were improved by our ability to quickly and easily experiment with different options before settling on the best one.” It ushers in a new age of ongoing experimentation.

Chief Product Officer and co-founder of CleverTap Anand Jain stated, “We’re excited to introduce Clever.AI is proof of our commitment over the past few years to setting the standard for early adoption of cutting-edge technology to revolutionize customer interaction. CleverTap’s All-in-One engagement platform will continue to be innovated by Clever.As a result of deeper persona profiling and advanced product analytics, AI is improving its predictive precision and strengthening its capacity to recommend intelligent customer experiences. This enables brands to create more successful campaigns that are outcome-driven and highly personalized for each and every customer interaction.

Brands have already seen an increase in conversion with noticeably greater operational efficiency thanks to Clever.AI. They saw a 3x improvement in click-through rates (CTRs), a 36% increase in conversion rates, and a 35% increase in operational efficiency. They also saw an increase in other metrics like purchases and average order values (AOVs). Additionally, by streamlining content creation, experimentation at scale, and campaign roll-outs, Clever.AI improved operational efficiency. Prominent companies like TouchnGo, Swiggy, and Burger King have benefited from the efficiency gains made by Clever.AI in their campaigns.

At its Spring Release ’24 event, which takes place from May 6–9, CleverTap will present its new AI capabilities through a series of stimulating sessions on how AI can improve the intelligence, effectiveness, and engagement of campaigns for brands.

Continue Reading

Technology

Oracle Introduces Database 23ai, Adding Artificial Intelligence to Enterprise Data

Published

on

Oracle has released Oracle Database 23ai, a new database technology that incorporates artificial intelligence. The release, which is now as a suite of cloud services, is concentrated on optimizing application development, supporting crucial workloads, and simplifying the use of AI.

One of its primary features, Oracle AI Vector Search, simplifies data search by letting users look up documents, photos, and relational data using conceptual content rather than precise keywords or data values.

AI Vector Search removes the need to transfer or duplicate data in order to process AI by enabling natural language queries on confidential business information stored in Oracle databases. The integration of AI in real-time with databases improves operational effectiveness, security, and efficiency.

Oracle Database 23ai is accessible via Oracle Cloud Infrastructure (OCI) on Oracle Database@Azure, Oracle Exadata Database Service, Oracle Exadata Cloud@Customer, and Oracle Base Database Service.

Oracle’s Executive Vice President of Mission-Critical Database Technologies, Juan Loaiza, emphasized the importance of Oracle Database 23ai and called it a revolutionary tool for multinational corporations.

“Building intelligent apps, increasing developer productivity, and managing mission-critical workloads is made simple for developers and data professionals by AI Vector Search in conjunction with new unified development paradigms and mission-critical capabilities,” the speaker stated.

Three major improvements have been made to Oracle Database 23ai: OCI GoldenGate 23ai for real-time data replication across heterogeneous stores, AI Vector Search for semantic search, and Oracle Exadata System Software 24ai for accelerated AI processing. By utilizing JSON and graph data models, mission-critical data security, and availability are guaranteed, and developers are empowered to create intelligent apps.

Customers may anticipate higher data security, more rapid enterprise application innovation, and increased operational efficiency with Oracle’s ongoing developments in AI-integrated databases. A strong foundation for companies embracing AI technologies is promised by Oracle Database 23ai, which marks a substantial advancement in AI-driven database systems.

Continue Reading

Technology

Google Introduces Gemini AI on Android Devices for Singapore Users

Published

on

Singapore is among the main beneficiaries of Google’s Gemini Mobile App, which enhances the AI capabilities of Android-based smartphones. With Gemini AI now supporting more languages and regions, this rollout is a part of Google’s larger strategy to make its advanced AI available to a global audience.

The Gemini app is now available for direct download or Google Assistant access for Android users in Singapore. The app works with Android phones running Android 12 or later and having at least 4 GB of RAM. On iOS devices running iOS 16 or later, users can interact with Gemini through a dedicated tab in the Google app.

With Gemini AI’s flexible and intuitive design, users can get help by speaking, typing, or uploading an image. To illustrate Google’s goal of developing a truly conversational and multimodal AI assistant, you could, for example, take a picture of a flat tire and receive detailed instructions on how to fix it, or ask for assistance writing a thank-you note.

Google is incorporating Gemini more thoroughly into its ecosystem in addition to the stand-alone app. With the help of new extensions, the AI can now effortlessly search through a wide range of Google services, including YouTube, Gmail, Docs, Drive, Maps, and even Google Flights and Hotels, to offer thorough support. Gemini’s ability to combine travel dates, lodging, and activities into a single itinerary based on user emails and preferences makes it an especially helpful tool for complicated tasks like organizing travel plans.

Additionally, Google is making using Gemini on desktops easier. By typing “@gemini” after their question, users can start direct inquiries from the address bar of the Chrome browser. This results in a rapid launch of the gemini.google.com page, which further integrates Gemini’s AI capabilities across platforms and shows answers right away.

Google’s latest developments improve the daily digital experience for users in Singapore and possibly globally, while also advocating for increased accessibility to AI tools.

Continue Reading

Trending

error: Content is protected !!