Wenyan-Qwen3-8B

An attempt to build a Xiaolong-like tune with more Gutenberg data on top of lemon07r/Qwen3-R1-SLERP-Q3T-8B.

Results

I haven't done much testing but the model will sometimes skip thinking. The second epoch may have overcooked it.

Condensed and formatted data available here.

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(2)

this model

Quantizations