tusharg92 (Tushar Gupta)

upvoted 2 articles 3 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

Sep 4

•

263

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

Sep 10

•

108

upvoted 2 articles 6 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

•

181

Article

Transformers backend integration in SGLang

+3

Jun 23

•

54

upvoted 3 articles 10 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

+4

Feb 20

•

315

Article

1 Billion Classifications

Feb 13

•

45

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4

•

1.31k

upvoted 3 papers about 1 year ago

upvoted a collection about 1 year ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666

upvoted 2 articles about 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

Sep 18, 2024

•

272

Article

Accelerate 1.0.0

+1

Sep 13, 2024

•

54

upvoted 2 papers over 1 year ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 42

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 138

upvoted an article over 1 year ago

Article

Welcome Gemma 2 - Google’s new open LLM

+4

Jun 27, 2024

•

132

upvoted 2 collections over 1 year ago

Jina Reranker v2

Collection

A collection of state-of-the-art multilingual neural rerankers • 1 item • Updated Jul 20 • 9

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Jul 21 • 374

Tushar Gupta

AI & ML interests