Introducing improvements to the fine-tuning API and expanding our custom models program

Assisted High quality-Tuning

At DevDay final November, we introduced a Customized Mannequin program designed to coach and optimize fashions for a particular area, in partnership with a devoted group of OpenAI researchers. Since then, we have met with dozens of shoppers to evaluate their {custom} mannequin wants and advanced our program to additional maximize efficiency.

Right now, we’re formally saying our assisted fine-tuning providing as a part of the Customized Mannequin program. Assisted fine-tuning is a collaborative effort with our technical groups to leverage methods past the fine-tuning API, corresponding to further hyperparameters and varied parameter environment friendly fine-tuning (PEFT) strategies at a bigger scale. It’s notably useful for organizations that want assist organising environment friendly coaching knowledge pipelines, analysis programs, and bespoke parameters and strategies to maximise mannequin efficiency for his or her use case or process.

For instance, SK Telecom, a telecommunications operator serving over 30 million subscribers in South Korea, needed to customise a mannequin to be an knowledgeable within the telecommunications area with an preliminary concentrate on customer support. They labored with OpenAI to fine-tune GPT-4 to enhance its efficiency in telecom-related conversations within the Korean language. Over the course of a number of weeks, SKT and OpenAI drove significant efficiency enchancment in telecom customer support duties—a 35% improve in dialog summarization high quality, a 33% improve in intent recognition accuracy, and a rise in satisfaction scores from 3.6 to 4.5 (out of 5) when evaluating the fine-tuned mannequin to GPT-4.

Customized-Educated Mannequin

In some circumstances, organizations want to coach a purpose-built mannequin from scratch that understands their enterprise, trade, or area. Totally custom-trained fashions imbue new data from a particular area by modifying key steps of the mannequin coaching course of utilizing novel mid-training and post-training methods. Organizations that see success with a completely custom-trained mannequin usually have giant portions of proprietary knowledge—tens of millions of examples or billions of tokens—that they need to use to show the mannequin new data or complicated, distinctive behaviors for extremely particular use circumstances.

For instance, Harvey, an AI-native authorized device for attorneys, partnered with OpenAI to create a custom-trained giant language mannequin for case regulation. Whereas basis fashions have been robust at reasoning, they lacked the intensive data of authorized case historical past and different data required for authorized work. After testing out immediate engineering, RAG, and fine-tuning, Harvey labored with our staff so as to add the depth of context wanted to the mannequin—the equal of 10 billion tokens value of information. Our staff modified each step of the mannequin coaching course of, from domain-specific mid-training to customizing post-training processes and incorporating knowledgeable legal professional suggestions. The ensuing mannequin achieved an 83% improve in factual responses and attorneys most well-liked the personalized mannequin’s outputs 97% of the time over GPT-4.

Source link

Introducing improvements to the fine-tuning API and expanding our custom models program

Google Circle to Search Feature Updated With In-Line Text Translation

MS-DEFCON 2: April sprinkles @ AskWoody

Related Posts

PRISE: A Unique Machine Learning Method for Learning Multitask Temporal Action Abstractions Using Natural Language Processing (NLP)

Radical Simplicity in Data Engineering | by Cai Parry-Jones | Jul, 2024

Amazon SageMaker inference launches faster auto scaling for generative AI models

Multilingual AI on Google Cloud: The Global Reach of Meta’s Llama 3.1 Models

SF-LLaVA: A Training-Free Video LLM that is Built Upon LLaVA-NeXT and Requires No Additional Fine-Tuning to Work Effectively for Various Video Tasks

Shaip Launches Generative AI Platform for Experimentation, Evaluation, & Monitoring of AI Applications

MS-DEFCON 2: April sprinkles @ AskWoody

iPhone 16, iPhone 16 Pro Designs Leak via Dummy Unit Images With Action and Capture Buttons

General Data Protection Regulation (GDPR): What you need to know to stay compliant

Leave a Reply Cancel reply

CMF Phone 1 Could Bring Mid-Range Specs in a Budget Price

AI method radically speeds predictions of materials’ thermal properties | MIT News

Map reveals the world’s mysterious eternal flames | Tech News

About 40% people with bipolar disorder achieved complete mental health, Canadian study finds

This philosopher wanted his mummified body on display – then his head disappeared | Tech News

Deals: Pixel 8a paired with $100 gift card, but Galaxy A35, S23 FE and OnePlus 12R prices fall

Tonga eruption entombed deep-sea life in ash

Today’s NYT Strands Hints, Answer and Help for July 27 #146

Google Pixel Buds Pro 2 price leaks

Manor Lords dev shuts down concern that new portraits are made by AI

Top Affordable Smartphones in 2024

PRISE: A Unique Machine Learning Method for Learning Multitask Temporal Action Abstractions Using Natural Language Processing (NLP)

CATEGORIES

SITEMAP

Welcome Back!

Retrieve your password