Hand Curated Top AI Papers The Illusion of State in State-Space Models December 5, 2024 Large Language Model Enhanced Text-to-SQL Generation: A Survey November 24, 2024 The Surprising Effectiveness of Test-Time Training for Abstract Reasoning November 15, 2024November 19, 2024 Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely November 15, 2024 A Taxonomy of AgentOps for Enabling Observability of Foundation Model based Agents November 12, 2024 Attacking Vision-Language Computer Agents via Pop-ups November 6, 2024 Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models November 5, 2024 Towards Reasoning in Large Language Models: A Survey October 31, 2024 Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications October 31, 2024 GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models October 31, 2024 AI Tutoring Outperforms Active Learning October 29, 2024 Large Language Models Reflect the Ideology of their Creators October 29, 2024 A Survey on Data Synthesis and Augmentation for Large Language Models October 22, 2024 xLAM: A Family of Large Action Models to Empower AI Agent Systems October 20, 2024 Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning October 12, 2024 Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely October 9, 2024 SMALL LANGUAGE MODELS: SURVEY, MEASUREMENTS, AND INSIGHTS October 3, 2024 SMALL LANGUAGE MODELS: SURVEY, MEASUREMENTS, AND INSIGHTS October 3, 2024 Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely October 1, 2024 Segment Anything Model 2 (SAM 2) September 29, 2024 ColPali: Efficient Document Retrieval with Vision Language Models September 28, 2024 Chain-of-Thought Prompting Elicits Reasoning in Large Language Models September 28, 2024 Agents in Software Engineering: Survey, Landscape, and Vision September 27, 2024 SMALL LANGUAGE MODELS: SURVEY, MEASUREMENTS, AND INSIGHTS September 26, 2024 Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely September 25, 2024 Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely September 25, 2024 NVLM: Open Frontier-Class Multimodal LLMs September 24, 2024 The Rapid Adoption of Generative AI September 24, 2024 Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves September 22, 2024 MUTUAL REASONING MAKES SMALLER LLMS STRONGER PROBLEM-SOLVERS September 21, 2024 Towards a Unified View of Preference Learning for Large Language Models: A Survey September 21, 2024 Towards a Unified View of Preference Learning for Large Language Models: A Survey September 21, 2024 REAC T : SYNERGIZING REASONING AND ACTING IN LANGUAGE MODELS September 20, 2024 TO COT OR NOT TO COT? CHAIN-OF-THOUGHT HELPS MAINLY ON MATH AND SYMBOLIC REASONING September 20, 2024 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters September 19, 2024 Trustworthiness in Retrieval-Augmented Generation Systems: A Survey September 18, 2024 LLMs Will Always Hallucinate, and We Need to Live With This September 16, 2024 LLMs Will Always Hallucinate, and We Need to Live With This September 16, 2024 Agents in Software Engineering: Survey, Landscape, and Vision September 16, 2024 Agents in Software Engineering: Survey, Landscape, and Vision September 16, 2024 Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents September 15, 2024 Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents September 15, 2024 Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking September 15, 2024 Large Language Model-Based Agents for Software Engineering: A Survey September 10, 2024 Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers September 10, 2024 LONGCITE: ENABLING LLMS TO GENERATE FINE-GRAINED CITATIONS IN LONG-CONTEXT QA September 8, 2024 Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Translation September 6, 2024 Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base September 6, 2024 The Effects of Generative AI on High Skilled Work: Evidence from Three Field Experiments with Software Developers September 5, 2024September 5, 2024 Query Rewriting for Retrieval-Augmented Large Language Models September 4, 2024 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs September 4, 2024 Corrective Retrieval Augmented Generation September 4, 2024 SELF-RAG: LEARNING TO RETRIEVE, GENERATE, AND CRITIQUE THROUGH SELF-REFLECTION September 4, 2024 CRAG – Comprehensive RAG Benchmark September 4, 2024
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning November 15, 2024November 19, 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely November 15, 2024
Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models November 5, 2024
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models October 31, 2024
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning October 12, 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely October 9, 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely October 1, 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely September 25, 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely September 25, 2024
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves September 22, 2024
TO COT OR NOT TO COT? CHAIN-OF-THOUGHT HELPS MAINLY ON MATH AND SYMBOLIC REASONING September 20, 2024
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters September 19, 2024
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers September 10, 2024
Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Translation September 6, 2024
Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base September 6, 2024
The Effects of Generative AI on High Skilled Work: Evidence from Three Field Experiments with Software Developers September 5, 2024September 5, 2024