TECHNOLOGY

H2O AI releases Danube, a giant-minute LLM for mobile applications

Robot sailing in Danube river

Robotic crusing in Danube river

Image Credit: Venturebeat made with Ideogram

This day, H2O AI, the corporate working to democratize AI with a unfold of originate-supply and proprietary tools, announced the open of Danube, a brand new giant-minute giant language mannequin (LLM) for mobile devices.

Named after the 2nd-finest river in Europe, the originate-supply mannequin comes with 1.8 billion parameters and is asserted to match or outperform equally sized devices across a unfold of pure language tasks. This puts it within the identical category as solid offerings from Microsoft, Steadiness AI and Eleuther AI.

The timing of the announcement makes finest sense. Enterprises building consumer devices are racing to net the chance of offline generative AI, where devices flee within the neighborhood on the product, giving customers like a flash support across capabilities and laying aside the hang to preserve recordsdata out to the cloud.

“We’re enraged to open H2O-Danube-1.8B as a portable LLM on tiny devices luxuriate in your smartphone… The proliferation of smaller, decrease-payment hardware and more efficient practicing now lets in modestly-sized devices to be accessible to a noteworthy broader target audience… We imagine H2O-Danube-1.8B will seemingly be a sport changer for mobile offline applications,” Sri Ambati, CEO and co-founding father of H2O, said in a assertion.

VB Occasion

The AI Affect Tour – NYC

We’ll be in Fresh York on February 29 in partnership with Microsoft to focus on how it is probably you’ll well well also balance dangers and rewards of AI applications. Query an invite to the odd match below.


Query an invite

What to anticipate from Danube-1.8B LLM?

While Danube has factual been announced, H2O claims it is going to even be comely-tuned to accommodate a unfold of pure language applications on tiny devices, together with typical sense reasoning, discovering out comprehension, summarization and translation. 

To prepare the mini mannequin, the corporate quiet a trillion tokens from diverse web sources and utilized ways refined from Llama 2 and Mistral devices to give a rob to its era capabilities.

“We adjusted the Llama 2 structure for a total of around 1.8B parameters. We (then) worn the customary Llama 2 tokenizer with a vocabulary dimension of 32,000 and educated our mannequin as much as a context length of 16,384. We incorporated the sliding window attention from Mistral with a dimension of 4,096,” the corporate infamous while describing the mannequin structure on Hugging Face.

When tested on benchmarks, the mannequin used to be stumbled on to be performing on par or better than most devices within the 1-2B-parameter category. 

As an illustration, within the Hellaswag test aimed at evaluating typical sense pure language inference, it performed with an accuracy of 69.58%, sitting factual within the encourage of Steadiness AI’s Stable LM 2 1.6 billion parameter mannequin pre-educated on 2 trillion tokens. Equally, within the Arc benchmark for evolved ask answering, it ranks third within the encourage of Microsoft Phi 1.5 (1.3-billion parameter mannequin) and Stable LM 2 with an accuracy of 39.42%.

H2O has launched Danube-1.8B below an Apache 2.0 license for commercial exercise. Any team taking a ogle to put into effect the mannequin for a mobile exercise case can download it from Hugging Face and originate utility-explicit comely-tuning. 

To manufacture this course of more uncomplicated, the corporate also plans to open additional tooling soon. It has also launched a chat-tuned version of the mannequin (H2O-Danube-1.8B-Chat), which will seemingly be utilized for conversational applications.

In the future, the provision of Danube and the same tiny-sized devices is anticipated to power a surge in offline generative AI applications across phones and laptops, serving to with tasks luxuriate in electronic mail summarization, typing and image editing. Truly, Samsung has already moved on this route with the open of its S24 line of smartphones.

VentureBeat’s mission is to be a digital town sq. for technical decision-makers to execute recordsdata about transformative project expertise and transact. Thought our Briefings.

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button