Wenyan-Qwen3-8B

An attempt to build a Xiaolong-like tune with more Gutenberg data on top of lemon07r/Qwen3-R1-SLERP-Q3T-8B.

Results

I haven't done much testing but the model will sometimes skip thinking. The second epoch may have overcooked it.

Data

Condensed and formatted data available here.

Downloads last month
10
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nbeerbower/Wenyan-Qwen3-8B

Finetuned
(2)
this model
Quantizations
2 models

Datasets used to train nbeerbower/Wenyan-Qwen3-8B