Dolly v2 (3B)

0.105

Chat

Offline

Dolly v2 (3B)

Dolly v2 (3B) API by Databricks is a fine-tuned instruction-following language model designed for various NLP tasks.

Try it now

Creates a chat completion

‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌

‍

AI Playground

Test all API models in the sandbox environment before you integrate. We provide more than 200 models to integrate into your app.

Testimonials

Our Clients' Voices

Cameron Pak

Full-stack design engineer

Powerful! I've used lemonfox, which is cheaper, but this is a little more and far more robust. thanks for sharing

Ivan I

Founder

"Nice alternative to Huggingface" What I like best about AI/ML APIs is the extensive range of available models they offer. These APIs provide access to a variety of pre-trained models that can perform tasks across different domains such as natural language processing. AI/ML APIs, especially those providing access to open-source large language models (LLMs), are solving a myriad of problems by democratizing access to advanced AI capabilities.

Ihor Herasymov

Founder of aicrunch.io

This saves money! If you use chatgpt daily. How come so not many users are knowing this…

Dolly v2 (3B)

Instruction-following model by Databricks, fine-tuned for diverse language tasks.

Model Overview Card for Dolly v2 (3B)

Basic Information

Model Name: Dolly v2 (3B)
Developer/Creator: Databricks, Inc.
Release Date: April 12, 2023
Version: Dolly-v2-3b
Model Type: Instruction-following Large Language Model

Description

Overview:
Dolly v2 (3B) is an instruction-following large language model created by Databricks, designed to follow instructions and perform various language tasks. Based on the Pythia-2.8b model, Dolly v2 (3B) has been fine-tuned on a dataset of approximately 15k instruction/response pairs to enhance its ability to generate high-quality responses to prompts.

Key Features:

Fine-tuned on ~15k instruction/response pairs
Capable of performing tasks such as brainstorming, classification, closed QA, generation, information extraction, open QA, and summarization
Licensed for commercial use
Available in larger sizes (dolly-v2-7b and dolly-v2-12b)

Intended Use:

Dolly v2 (3B) is designed for various natural language processing tasks including brainstorming, classification, closed and open question answering, generation, information extraction, and summarization. It is suitable for applications requiring high-quality instruction following, though it is not state-of-the-art.

Language Support:

Supports English. Other languages might be supported but with potentially less accuracy due to the training data being primarily in English.

Technical Details

Architecture:

Dolly v2 (3B) is based on the Pythia-2.8b model, a Transformer-based architecture.

Training Data:

The model was trained on a dataset of approximately 15,000 instruction/response pairs generated by Databricks employees. This dataset, named databricks-dolly-15k, covers various domains mentioned in the InstructGPT paper, including brainstorming, classification, QA, and summarization.

Data Source and Size:

Source: Public internet, including Wikipedia.
Size: Approximately 15,000 instruction/response pairs.
Knowledge Cutoff: The model's knowledge is up-to-date until April 2023.
Diversity and Bias: The dataset includes data that reflects the interests and biases of Databricks employees, potentially limiting diversity. It is also subject to the biases present in the public internet data from which it was derived.

Performance Metrics:

Comparison to Other Models:
Dolly v2 (3B) outperforms its foundation model Pythia-2.8b and shows competitive performance with similar parameter models but underperforms compared to state-of-the-art models like GPT-4 and LLaMA-3.
Accuracy: Demonstrates strong instruction-following behavior, but may struggle with syntactically complex prompts, programming problems, mathematical operations, factual accuracy, and handling dates and times.
Speed: Optimized for inference on GPUs; performance varies based on hardware.
Robustness: Handles a wide range of instructions but may produce errors in specific complex or ambiguous tasks.

Usage

Code Samples/SDK

Creates a chat completion

‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌
‌

Ethical Considerations

Databricks is committed to developing AI technologies that are helpful, honest, and harmless. The model has limitations and may produce biased or harmful outputs, reflecting the biases present in the training data.