🧠 Model

Qwen2.5-3B-Instruct

by Qwen

--- license: other license_name: qwen-research license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE language: - en pipeline_tag: text-generation base_model: Qwen/Qwen2.5-3B tags: - chat library_name: transformers --- Qwen2.5 is the latest series of Qwen large language mode...

🕐 Updated 12/18/2025

🧠 Architecture Explorer

Neural network architecture

1 Input Layer

2 Hidden Layers

3 Attention

4 Output Layer

Learn about Transformers →

About

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2: -...

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
• Data source: [{"source_platform":"huggingface","source_url":"https://huggingface.co/Qwen/Qwen2.5-3B-Instruct","fetched_at":"2025-12-18T04:21:59.020Z","adapter_version":"3.2.0"}]

📚 Related Resources

📄 Related Papers

No related papers linked yet. Check the model's official documentation for research papers.

📊 Training Datasets

Training data information not available. Refer to the original model card for details.

🔗 Related Models

Data unavailable

Model Information Summary
Model Name
Author	Qwen
Type	text-generation
Downloads	8,951,498
Likes	350
Source	Unknown
Last Updated	December 18, 2025

Graph Overview

200 Models

460 Connections

Explore Full Graph →

🚀 What's Next?

📊