Nous: Hermes 2 Mistral 7B DPO

nousresearch/nous-hermes-2-mistral-7b-dpo

This is the flagship 7B Hermes model, a Direct Preference Optimization (DPO) of Teknium/OpenHermes-2.5-Mistral-7B. It shows improvement across the board on all benchmarks tested - AGIEval, BigBench Reasoning, GPT4All, and TruthfulQA.

The model prior to DPO was trained on 1,000,000 instructions/chats of GPT-4 quality or better, primarily synthetic data as well as other high quality datasets.

Model weights

Modalities

Context

Low

8K

Released

Feb 21, 2024

Knowledge Cutoff

Sep 2023

Activity

Token volume and request traffic to this model over time.