Skip to main content

Pre-configured Models

Overview

Jan provides various pre-configured AI models with different capabilities. Please see the following list for details.

ModelDescription
Mistral Instruct 7B Q4A model designed for a comprehensive understanding through training on extensive internet data
OpenHermes Neural 7B Q4A merged model using the TIES method. It performs well in various benchmarks
Stealth 7B Q4This is a new experimental family designed to enhance Mathematical and Logical abilities
Trinity-v1.2 7B Q4An experimental model merge using the Slerp method
Openchat-3.5 7B Q4An open-source model that has a performance that surpasses that of ChatGPT-3.5 and Grok-1 across various benchmarks
Wizard Coder Python 13B Q5A Python coding model that demonstrates high proficiency in specific domains like coding and mathematics
OpenAI GPT 3.5 TurboThe latest GPT-3.5 Turbo model with higher accuracy at responding in requested formats and a fix for a bug that caused a text encoding issue for non-English language function calls
OpenAI GPT 3.5 Turbo 16k 0613A Snapshot model of gpt-3.5-16k-turbo from June 13th 2023
OpenAI GPT 4The latest GPT-4 model intended to reduce cases of “laziness” where the model doesn't complete a task
TinyLlama Chat 1.1B Q4A tiny model with only 1.1B. It's a good model for less powerful computers
Deepseek Coder 1.3B Q8A model that excelled in project-level code completion with advanced capabilities across multiple programming languages
Phi-2 3B Q8a 2.7B model, excelling in common sense and logical reasoning benchmarks, trained with synthetic texts and filtered websites
Llama 2 Chat 7B Q4A model that is specifically designed for a comprehensive understanding through training on extensive internet data
CodeNinja 7B Q4A model that is good for coding tasks and can handle various languages, including Python, C, C++, Rust, Java, JavaScript, and more
Noromaid 7B Q5A model designed for role-playing with human-like behavior.
Starling alpha 7B Q4An upgrade of Openchat 3.5 using RLAIF, is good at various benchmarks, especially with GPT-4 judging its performance
Yarn Mistral 7B Q4A language model for long context and supports a 128k token context window
LlaVa 1.5 7B Q5 KA model can bring vision understanding to Jan
BakLlava 1A model can bring vision understanding to Jan
Solar Slerp 10.7B Q4A model that uses the Slerp merge method from SOLAR Instruct and Pandora-v1
LlaVa 1.5 13B Q5 KA model can bring vision understanding to Jan
Deepseek Coder 33B Q5A model that excelled in project-level code completion with advanced capabilities across multiple programming languages
Phind 34B Q5A multi-lingual model that is fine-tuned on 1.5B tokens of high-quality programming data, excels in various programming languages, and is designed to be steerable and user-friendly
Yi 34B Q5A specialized chat model is known for its diverse and creative responses and excels across various NLP tasks and benchmarks
Capybara 200k 34B Q5A long context length model that supports 200K tokens
Dolphin 8x7B Q4An uncensored model built on Mixtral-8x7b and it is good at programming tasks
Mixtral 8x7B Instruct Q4A pre-trained generative Sparse Mixture of Experts, which outperforms 70B models on most benchmarks
Tulu 2 70B Q4A strong model alternative to Llama 2 70b Chat to act as helpful assistants
Llama 2 Chat 70B Q4A model that is specifically designed for a comprehensive understanding through training on extensive internet data
note

OpenAI GPT models require a subscription to use them further. To learn more, click here.

Model details

ModelAuthorModel IDFormatSize
Mistral Instruct 7B Q4MistralAI, The Blokemistral-ins-7b-q4GGUF4.07GB
OpenHermes Neural 7B Q4Intel, Janopenhermes-neural-7bGGUF4.07GB
Stealth 7B Q4Janstealth-v1.2-7bGGUF4.07GB
Trinity-v1.2 7B Q4Jantrinity-v1.2-7bGGUF4.07GB
Openchat-3.5 7B Q4Openchatopenchat-3.5-7bGGUF4.07GB
Wizard Coder Python 13B Q5WizardLM, The Blokewizardcoder-13bGGUF7.33GB
OpenAI GPT 3.5 TurboOpenAIgpt-3.5-turboGGUF-
OpenAI GPT 3.5 Turbo 16k 0613OpenAIgpt-3.5-turbo-16k-0613GGUF-
OpenAI GPT 4OpenAIgpt-4GGUF-
TinyLlama Chat 1.1B Q4TinyLlamatinyllama-1.1bGGUF638.01MB
Deepseek Coder 1.3B Q8Deepseek, The Blokedeepseek-coder-1.3bGGUF1.33GB
Phi-2 3B Q8Microsoftphi-2-3bGGUF2.76GB
Llama 2 Chat 7B Q4MetaAI, The Blokellama2-chat-7b-q4GGUF3.80GB
CodeNinja 7B Q4Beowolxcodeninja-1.0-7bGGUF4.07GB
Noromaid 7B Q5NeverSleepnoromaid-7bGGUF4.07GB
Starling alpha 7B Q4Berkeley-nest, The Blokestarling-7bGGUF4.07GB
Yarn Mistral 7B Q4NousResearch, The Blokeyarn-mistral-7bGGUF4.07GB
LlaVa 1.5 7B Q5 KMysllava-1.5-7b-q5GGUF5.03GB
BakLlava 1Mysbakllava-1GGUF5.36GB