Connect with us

Technology

Quantization of models and the emergence of edge AI

Published

on

Quantization of models and the emergence of edge AI

The amalgamation of edge computing and artificial intelligence holds the potential to revolutionize numerous industries. In this case, the quick development of model quantization—a method that increases portability and decreases model size to enable faster computation—is crucial.

When paired with appropriate methods and tools, edge AI has the potential to completely change how we interact with data and data-driven applications.

Why does AI edge?

Bringing data processing and models closer to the point of data generation—that is, to a remote server, tablet, IoT device, or smartphone—is the goal of edge AI. This makes real-time, low-latency AI possible. By 2025, deep neural networks will analyze more than half of all data at the edge, predicts Gartner. This paradigm change will have several benefits.

Decreased latency:

Edge AI eliminates the need to send data back and forth to the cloud by processing data directly on the device. Applications that need quick responses and rely on real-time data must take this into consideration.

Decreased complexity and costs:

Sending information back and forth doesn’t require costly data transfers when data is processed locally at the edge.

Data stays on the device, minimizing security risks related to data transmission and data leakage. This preserves privacy.

Improved scalability:

Applications can be scaled more easily without depending on a central server for processing power thanks to the decentralized strategy with edge AI.

Manufacturers can integrate edge AI, for instance, into their defect detection, quality control, and predictive maintenance procedures. Manufacturers can better utilize real-time data to decrease downtime and enhance production processes and efficiency by implementing AI and locally analyzing data from smart machines and sensors.

Model quantization’s function

AI models must be optimized for performance without sacrificing accuracy in order for edge AI to be successful. AI models are growing larger, more complex, and more intricate, which makes them more difficult to manage. This makes it difficult to deploy AI models at the edge, since edge devices frequently have low resources and are unable to support these kinds of models.

Model quantization makes the models lighter and more appropriate for deployment on resource-constrained devices like mobile phones, edge devices, and embedded systems by reducing the numerical precision of the model parameters (from 32-bit floating point to 8-bit integer, for example).

Three methods—GPTQ, LoRA, and QLoRA—have surfaced as possible game-changers in the field of model quantization:

Models are compressed as part of GPTQ after training. When deploying models in settings with constrained memory, it works perfectly.

Large pre-trained models must be adjusted for inferencing in LoRA. In particular, it adjusts the smaller matrices (called LoRA adapters) that comprise the large matrix of a model that has already been trained.

Using GPU memory for the pre-trained model makes QLoRA a more memory-efficient choice. When modifying models for new tasks or data sets with limited computational resources, LoRA and QLoRA are particularly helpful.

The particular requirements of the project, whether it is in the deployment or fine-tuning phase, and whether it has the computational resources available all play a significant role in the method selection. Developers can effectively push AI to the limit by utilizing these quantization techniques, striking a balance between efficiency and performance—a crucial aspect for many applications.

Edge platforms and use cases for AI

Edge AI has a wide range of uses. The possibilities are endless: wearable health devices that identify abnormalities in the wearer’s vitals; smart cameras that process images for rail car inspections at train stations; and smart sensors that keep an eye on inventory on store shelves. For this reason, IDC projects that spending on edge computing will amount to $317 billion by 2028. The edge is changing the way businesses handle data.

Strong edge inferencing databases and stacks will become more and more in demand as businesses realize the advantages of AI inferencing at the edge. These platforms offer all the benefits of edge AI, including lower latency and increased data privacy, while also facilitating local data processing.

Continue Reading
Advertisement

Technology

Threads uses a more sophisticated search to compete with Bluesky

Published

on

Instagram Threads, a rival to Meta’s X, will have an enhanced search experience, the firm said Monday. The app, which is based on Instagram’s social graph and provides a Meta-run substitute for Elon Musk’s X, is introducing a new feature that lets users search for certain posts by date ranges and user profiles.

Compared to X’s advanced search, which now allows users to refine queries by language, keywords, exact phrases, excluded terms, hashtags, and more, this is less thorough. However, it does make it simpler for users of Threads to find particular messages. Additionally, it will make Threads’ search more comparable to Bluesky’s, which also lets users use sophisticated queries to restrict searches by user profiles, date ranges, and other criteria. However, not all of the filtering options are yet visible in the Bluesky app’s user interface.

In order to counter the danger posed by social networking startup Bluesky, which has quickly gained traction as another X competitor, Meta has started launching new features in quick succession in recent days. Bluesky had more than 9 million users in September, but in the weeks after the U.S. elections, users left X due to Elon Musk’s political views and other policy changes, including plans to alter the way blocks operate and let AI companies train on X user data. According to Bluesky, there are currently around 24 million users.

Meta’s Threads introduced new features to counter Bluesky’s potential, such as an improved algorithm, a design modification that makes switching between feeds easier, and the option for users to select their own default feed. Additionally, it was observed creating Starter Packs, its own version of Bluesky’s user-curated recommendation lists.

Continue Reading

Technology

Apple’s own 5G modem-equipped iPhone SE 4 is “confirmed” to launch in March

Published

on

Tom O’Malley, an analyst at Barclays, recently visited Asia with his colleagues to speak with suppliers and makers of electronics. The analysts said they had “confirmed” that a fourth-generation iPhone SE with an Apple-designed 5G modem is scheduled to launch near the end of the first quarter next year in a research note they released this week that outlines the main conclusions from the trip. That timeline implies that the next iPhone SE will be unveiled in March, similar to when the present model was unveiled in 2022, in keeping with earlier rumors.

The rumored features of the fourth-generation iPhone SE include a 6.1-inch OLED display, Face ID, a newer A-series chip, a USB-C port, a single 48-megapixel rear camera, 8GB of RAM to enable Apple Intelligence support, and the previously mentioned Apple-designed 5G modem. The SE is anticipated to have a similar design to the base iPhone 14.

Since 2018, Apple is said to have been developing its own 5G modem for iPhones, a move that will let it lessen and eventually do away with its reliance on Qualcomm. With Qualcomm’s 5G modem supply arrangement for iPhone launches extended through 2026 earlier this year, Apple still has plenty of time to finish switching to its own modem. In addition to the fourth-generation iPhone SE, Apple analyst Ming-Chi Kuo earlier stated that the so-called “iPhone 17 Air” would come with a 5G modem that was created by Apple.

Whether Apple’s initial 5G modem would offer any advantages to consumers over Qualcomm’s modems, such quicker speeds, is uncertain.

Qualcomm was sued by Apple in 2017 for anticompetitive behavior and $1 billion in unpaid royalties. In 2019, Apple purchased the majority of Intel’s smartphone modem business after the two firms reached a settlement in the dispute. Apple was able to support its development by acquiring a portfolio of patents relating to cellular technology. It appears that we will eventually be able to enjoy the results of our effort in four more months.

On March 8, 2022, Apple made the announcement of the third-generation iPhone SE online. With antiquated features like a Touch ID button, a Lightning port, and large bezels surrounding the screen, the handset resembles the iPhone 8. The iPhone SE presently retails for $429 in the United States, but the new model may see a price increase of at least a little.

Continue Reading

Technology

Google is said to be discontinuing the Pixel Tablet 2 and may be leaving the market once more

Published

on

Google terminated the development of the Pixel Tablet 3 yesterday, according to Android Headlines, even before a second-generation model was announced. The second-generation Pixel Tablet has actually been canceled, according to the report. This means that the gadget that was released last year will likely be a one-off, and Google is abandoning the tablet market for the second time in just over five years.

If accurate, the report indicates that Google has determined that it is not worth investing more money in a follow-up because of the dismal sales of the Pixel Tablet. Rumors of a keyboard accessory and more functionality for the now-defunct project surfaced as recently as last week.

It’s important to keep in mind that Google’s Nest subsidiary may abandon its plans for large-screen products in favor of developing technologies like the Nest Hub and Hub Max rather than standalone tablets.

Google has always had difficulty making a significant impact in the tablet market and creating a competitor that can match Apple’s iPad in terms of sales and general performance, not helped in the least by its inconsistent approach. Even though the hardware was good, it never really fought back after getting off to a promising start with the Nexus 7 eons ago. Another problem that has hampered Google’s efforts is that Android significantly trails iPadOS in terms of the quantity of third-party apps that are tablet-optimized.

After the Pixel Slate received tremendously unfavorable reviews, the firm first declared that it was finished producing tablets in 2019. Two tablets that were still in development at the time were discarded.

By 2022, however, Google had altered its mind and declared that a tablet was being developed by its Pixel hardware team. The $499 Pixel Tablet was the final version of the gadget, which came with a speaker dock that the tablet could magnetically connect to. (Google would subsequently charge $399 for the tablet alone.)

Continue Reading

Trending

error: Content is protected !!