Connect with us

Technology

MM1, a Family of Multimodal AI Models with up to 30 billion Parameters, is being Developed by Apple Researchers

Published

on

In a pre-print paper, Apple researchers presented their work on developing a multimodal large language model (LLM) for artificial intelligence (AI). The paper describes how it was possible to achieve the advanced capabilities of multimodality and train the foundation model on both text-only data and images, and it was published on an online portal on March 14. The Cupertino-based tech giant has made new advances in AI in response to CEO Tim Cook’s statement during the company’s earnings calls, which stated that AI features might be released later this year.

ArXiv, an open-access online repository for scholarly papers, has published the research paper’s pre-print version. Peer review is not, however, applied to the papers that are posted here. The project is thought to be connected to Apple as well, even though the paper makes no mention of the company; this is because the majority of the researchers mentioned are connected to the machine learning (ML) division of Apple.

A family of multimodal models with up to 30 billion parameters, known as MM1, is the project that the researchers are currently working on. The paper’s authors referred to it as a “performant multimodal LLM (MLLM)” and noted that in order to build an AI model that can comprehend both text and image-based inputs, image encoders, the vision language connector, and other architecture elements and data decisions were made.

The paper provided an example in stating that “We demonstrate that achieving state-of-the-art (SOTA) few-shot results across multiple benchmarks, compared to other published pre-training results, requires a careful mix of image-caption, interleaved image-text, and text-only data for large-scale multimodal pre-training.”

To put it simply, the AI model has not received enough training to produce the intended results and is presently in the pre-training phase. This phase involves designing the model’s workflow and data processing eventually using the algorithm and AI architecture. The researchers at Apple were able to incorporate computer vision into the model by means of a vision language connector and image encoders. Upon conducting tests using a combination of image-only, image-text, and text-only data sets, the team discovered that the outcomes were comparable to those of other models at the same stage.

Although this is a significant breakthrough, there is insufficient evidence in this research paper to conclude that Apple will integrate a multimodal AI chatbot into its operating system. It’s difficult to even say at this point whether the AI model is multimodal in terms of receiving inputs or producing output (i.e., whether it can produce AI images or not). However, it can be said that the tech giant has made significant progress toward developing a native generative AI foundation model if the results are verified to be consistent following peer review.

Technology

Windows 11 PCs with Arm Processors now have an Official ISO for Clean Installations

Published

on

Power users occasionally prefer to start over when they acquire a new computer, so they follow the pro-gamers’ advice and reinstall Windows using a brand-new ISO image that comes straight from Microsoft and is free of bloatware and needlessly complex “driver management programs.” Up until recently, the new Snapdragon laptops’ more specialized version of Windows 11 didn’t support that.

The Windows 11 build on these new laptops is unusual because of the Arm64-based hardware, which differs from the typical x86 and x64 innards found in most laptops and desktops. Microsoft has finally released a disk image (or ISO file) for these devices after several months of waiting. To perform a direct reinstallation or make a bootable flash drive for a different device, you may now download it straight from Microsoft’s website. It is identical to the installation media utility that is currently available.

Be aware that there may be some glitches if you use this method for a fresh install. Compared to previous designs, the Snapdragon X system-on-a-chip has a lot fewer hardware variables, but because it’s so new, Windows Update might not include all the necessary components. You may need to use an Ethernet connection or the old-fashioned sneakernet to manually load drivers from another computer. You may also need to do some Googling to locate all the files you require for that.

Continue Reading

Technology

OPPO Reno 13 series will debut in China shortly, with India following in 2025

Published

on

According to reports, OPPO, a Chinese firm, is getting ready to introduce its Reno 13 series smartphones in its native nation this month. As per 91Mobiles, the OPPO Reno 13 and Reno 13 Pro models are anticipated to debut in China on November 25. The Indian launch is probably set for January 2025. The smartphone series that debuted in July of this year, the Reno 12 series, will be replaced by the Reno 13 series.

Information regarding the specifications of the new Reno 13 and Reno 13 Pro smartphones has leaked online, although the business has not yet confirmed the launch date. These are the specifics:

OPPO Reno 13 Series: Anticipations

It is anticipated that the OPPO Reno 13 Pro would have a 6.78-inch, quad-curved OLED screen with 1.5K resolution. In contrast, the slightly smaller 6.7-inch display with FHD+ resolution is found on the OPPO Reno 12 Pro. In China, the Pro model is probably going to be powered by the MediaTek Dimensity 8350 chipset, while in India, it might have a different processor. A 50MP primary camera, an 8MP ultrawide sensor, and a 50MP telephoto sensor with 3x optical zoom are anticipated to be included in the OPPO Reno 13 Pro’s photographic setup. Most likely, the front camera will include a 50MP sensor.

With a 5,900mAh battery as opposed to the 5,000mAh battery on the Reno 12 Pro, the Reno 13 Pro is anticipated to significantly increase battery capacity. Additionally, it is anticipated that the smartphone would support both 50W wireless and 80W wired charging. Additionally, an IP68/IP69 designation for water and dust protection could increase its durability.

Although the price of the smartphones in the Reno 13 series is not well known, it is anticipated to be similar to that of its predecessor. For comparison, the 12GB RAM + 256GB storage version of the OPPO Reno 12 Pro launched at Rs 36,999, while the 8GB RAM + 256GB storage version of the vanilla model cost Rs 32,999.

OPPO Reno 13 Pro: Anticipated features

  • Display: 6.78-inch OLED, quad-curved, with a refresh rate of 120 Hz and a resolution of 1.5K
  • processor: MediaTek Dimensity 8350
  • rear camera: 50MP primary, 8MP ultra-wide, and 50MP telephoto (3x zoom)
  • front camera: 50MP
  • Battery: 5,900mAh
  •  Charging: 50W wireless and 80W wired
  • IP rating: IP68/IP69; operating system: ColorOS 15 based on Android 15

Continue Reading

Technology

Apple has released Final Cut Pro 11, an AI-powered program

Published

on

Apple introduced Final Cut X thirteen years ago. Considering that the video-editing program marked its 25th birthday this April, that represents just over half of its lifetime. Some have questioned whether the corporation has discreetly withdrawn the offering due to its multiple lifetimes in the consumer software industry.

Final Cut Pro finally reaches level 11, after 13 years of waiting, and Apple is no longer playing around. On Wednesday, the program will be accessible for download. After a 90-day trial period, new users will need to pay $300 to buy Final Cut Pro 11 from the Mac App Store, while current users will receive it as a free update.

What specifically justified the much anticipated move to 11? AI is two letters. The business is using AI to power new features just weeks after releasing Apple Intelligence for iOS, iPadOS, and MacOS.

Magnetic Mask is at the top of the list because it makes it simple to crop objects and people out of videos without using a green screen.

According to Apple, “This powerful and precise automatic analysis provides additional flexibility to customize backgrounds and environments,” “Editors can also combine Magnetic Mask with color correction and video effects, allowing them to precisely control and stylize each project.”

Transcribe to Captions, which basically adds text to Final Cut’s timeline, is the second standout AI-based tool here. The company claims that its in-house large language model (LLM) powers that feature.

Apple’s problematic mixed-reality headset is the subject of this article’s other major headline. The most recent iPhones now have the capability to record Spatial Video, and Final Cut may be used to edit that footage. It is possible to add effects, color correct the video, and change the titles’ depth placement.

Apple is reportedly working on a more inexpensive variant, even though CEO Tim Cook has acknowledged that the $3,500 headgear isn’t the mainstream consumer product the company wanted. Along with the iPhone 15 Pro and all iPhone 16 models, the Vision Pro itself can record spatial video. Additionally, Canon just unveiled a new twin lens that works with R7 cameras.

Additionally, there are various time-saving features in the new Final Cut. For example, Magnetic Timeline allows you to swiftly rearrange clips while maintaining audio and video synchronization.

According to Apple, Final Cut Pro 11 was developed especially for the M-series of CPUs, which are its first-party silicon. This includes having more simultaneous 4K and 8K playback capabilities.

Apple claims that the M-series of chips, their first-party silicon, were the reason behind the creation of Final Cut Pro 11. This includes the capacity to play back several 4K and 8K ProRes video streams at once.

Final Cut Pro for iPad 2.1 is being released by Apple concurrently with the eagerly anticipated release of Pro 11. The brightness and color of the touched-based interface will be increased, and the workflow will be enhanced as well. Starting on Wednesday, current users can also obtain that for free.

Continue Reading

Trending

error: Content is protected !!