Startec

Startec

Meet BLOOMChat: An Open-Source 176-Billion-Parameter Multilingual Chat Large Language Model (LLM) Built on Top of the BLOOM Model

Hoje, às 01:46

·

4 min de leitura

·

0 leituras

With some great advancements being made in the field of Artificial Intelligence, natural language systems are rapidly progressing. Large Language Models (LLMs) are getting significantly better and more...
Meet BLOOMChat: An Open-Source 176-Billion-Parameter Multilingual Chat Large Language Model (LLM) Built on Top of the BLOOM Model

With some great advancements being made in the field of Artificial Intelligence, natural language systems are rapidly progressing. Large Language Models (LLMs) are getting significantly better and more popular with each upgrade and innovation. A new feature or modification is being added nearly daily, enabling LLMs to serve in different applications in almost every domain. LLMs are everywhere, from Machine translation and text summarization to sentiment analysis and question answering.

The open-source community has made some remarkable progress in developing chat-based LLMs, but mostly in the English language. A little less focus has been put on developing kind of similar multilingual chat capability in an LLM. To address that, SambaNova, a software company that focuses on generative AI solutions, has introduced an open-source, multilingual chat LLM called BLOOMChat. Developed in collaboration with Together, which is an open, scalable, and decentralized cloud for Artificial Intelligence, BLOOMChat is a 176-billion-parameter multilingual chat LLM built on top of the BLOOM model.

The BLOOM model has the ability to generate text in 46 natural languages and 13 programming languages. For languages such as Spanish, French, and Arabic, BLOOM represents the first language model ever created with over 100 billion parameters. BLOOM was developed by the BigScience organization, which is an international collaboration of over 1000 researchers. By fine-tuning BLOOM on open conversation and alignment datasets from projects like OpenChatKit, Dolly 2.0, and OASST1, the core capabilities of BLOOM were extended into the chat domain.

For the development of the multilingual chat LLM, BLOOMChat, SambaNova, and Together have used the SambaNova DataScale systems that utilize SambaNova’s unique Reconfigurable Dataflow Architecture for the training process. Synthetic conversation data and human-written samples have been combined to create BLOOMChat. A big synthetic dataset called OpenChatKit has served as the basis for chat functionality, and higher-quality human-generated datasets like Dolly 2.0 and OASST1 have been used to enhance performance significantly. The code and scripts used for instruction-tuning on the OpenChatKit and Dolly-v2 datasets have been made available on SambaNova’s GitHub.

In human evaluations conducted across six languages, BLOOMChat responses were preferred over GPT-4 responses 45.25% of the time. Compared to four other open-source chat-aligned models in the same six languages, BLOOMChat’s responses ranked as the best 65.92% of the time. This accomplishment successfully closes the open-source market’s multilingual chat capability gap. In the WMT translation test, BLOOMChat performed better than additional BLOOM model iterations as well as popular open-source conversation models.

BLOOMChat, like other chat LLMs, has limitations. It may produce factually incorrect or irrelevant information or may switch languages by mistake. It can even repeat phrases, have limited coding or math capabilities, and sometimes generate toxic content. Further research is working towards addressing these challenges and ensuring better usage.

In conclusion, BLOOMChat builds upon the extensive work of the open-source community and is a great addition to the list of some highly useful and multilingual LLMs. By releasing it under an open-source license, SambaNova and Together aims to expand access to advanced multilingual chat capabilities and encourage further innovation in the AI research community.


Check out the Project and Reference Article. Don’t forget to join our 21k+ ML SubRedditDiscord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more. If you have any questions regarding the above article or if we missed anything, feel free to email us at [email protected]

🚀 Check Out 100’s AI Tools in AI Tools Club

Tanya Malhotra

Tanya Malhotra is a final year undergrad from the University of Petroleum & Energy Studies, Dehradun, pursuing BTech in Computer Science Engineering with a specialization in Artificial Intelligence and Machine Learning.
She is a Data Science enthusiast with good analytical and critical thinking, along with an ardent interest in acquiring new skills, leading groups, and managing work in an organized manner.


Continue lendo

DEV

Day 1 - Menjadi Android Developer
Hari ini merupakan awal saya mempelajari android, dan melalui platform ini saya akan menuliskan perjalanan saya selama mempelajari android. Hari ini saya belajar melalui web developer.android.com mulai dari...

Hoje, às 15:42

AI | Techcrunch

Microsoft's Azure AI Studio lets developers build their own AI 'copilots'
Microsoft wants companies to build their own AI-powered “copilots” — using tools on Azure and machine learning models from its close partner OpenAI, of course. Today at its annual Build conference, Microsoft...

Hoje, às 15:00

AI | Techcrunch

Microsoft pledges to watermark AI-generated images and videos
Balenciaga Pope. Fake Pentagon explosions. It’s becoming increasingly difficult to tell AI-generated images apart from the real thing, sometimes to disastrous effect. A solution remains elusive. But...

Hoje, às 15:00

DEV

Agile Certifications: Boost Your Career with Agile
In today's fast-paced world, Agile certifications have become a must-have for professionals in the software development industry. This article will guide you through the various Agile certifications and how...

Hoje, às 14:48

HackerNoon

Voyage on Goat Island | HackerNoon
Start WritingNotificationssee moreVoyage on Goat Island [email protected] 23rd 2023 New Story by @hgwells Too Long; Didn't ReadThe War in the Air, by H. G. Wells, is part of the HackerNoon Books Series. Enjoy...

Hoje, às 14:25

HackerNoon

REST API Design Mistakes | HackerNoon
May 23rd 2023 New Story by @KonstantinGlumov Too Long; Didn't ReadREST is the de facto standard in the industry for many years. It was invented by Roy Fielding, who was also one of the creators of the HTTP...

Hoje, às 14:09

HackerNoon

67 Stories To Learn About Node | HackerNoon
May 23rd 2023 New Story11m by @learn Too Long; Didn't ReadPeople Mentionedprogramming#node#learn#learn-node#[email protected] LearnReceive Stories from @learnGet free API security automated scan in...

Hoje, às 13:56

HackerNoon

51 Stories To Learn About No Code Platform | HackerNoon
May 23rd 2023 New Story9m by @learn Too Long; Didn't Readprogramming#no-code-platform#learn#[email protected] LearnReceive Stories from @learnGet free API security automated scan in minutesRELATED...

Hoje, às 13:56

HackerNoon

92 Stories To Learn About Nocode | HackerNoon
May 23rd 2023 New Story16m by @learn Too Long; Didn't ReadPeople Mentionedscience#nocode#learn#learn-nocode#[email protected] LearnReceive Stories from @learnGet free API security automated scan in...

Hoje, às 13:56

TabNews

[DÚVIDA] Topam me ajudar em um projeto top? · 7Cheater8
Olá, pessoal, como vocês estão? Queria saber quantos de vocês são da área de cyber segurança ou tem interesse, e se topam me ajudar numa coisa, tenho uma página no notion com várias anota...

Hoje, às 13:54