view article Article Breaking Language Barriers in Mathematical AI: Introducing Hebrew Math Tutor Sep 7 • 3
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques Mar 24 • 20
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques Mar 24 • 20
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Paper • 2502.09390 • Published Feb 13 • 16
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model +6 Oct 29, 2024 • 59
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper • 2408.02545 • Published Aug 5, 2024 • 39
Accelerating Speculative Decoding using Dynamic Speculation Length Paper • 2405.04304 • Published May 7, 2024 • 2
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 18
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 18
ABSApp: A Portable Weakly-Supervised Aspect-Based Sentiment Extraction System Paper • 1909.05608 • Published Sep 12, 2019
Term Set Expansion based NLP Architect by Intel AI Lab Paper • 1808.08953 • Published Aug 27, 2018 • 1