entity

arXiv

synthesized from dimensions

arXiv is a prominent, open-access repository and preprint server that serves as a foundational platform for the global research community to disseminate scientific and technical findings. By hosting papers before they undergo formal peer review and publication in traditional journals, arXiv enables the rapid exchange of knowledge across diverse disciplines, with a particularly high concentration of activity in computer science, artificial intelligence, and machine learning.

The core identity of arXiv is defined by its commitment to openness, community, and the acceleration of scientific progress arXiv is committed to the values of openness. It provides a standardized infrastructure for researchers to document innovations—ranging from technical architectural improvements like Gated delta networks Gated delta networks: improving mamba2 with delta rule to complex theoretical frameworks such as Integrated information theory eLife reference. This accessibility ensures that scholars can track real-time developments in rapidly evolving fields like Large Language Models (LLMs) Large Language Models (LLMs), retrieval-augmented generation Evaluation of retrieval-augmented generation: A survey, and knowledge graph integration Knowledge graph large language model (kg-llm) for link prediction.

A key characteristic of the platform is its robust identifier system, which assigns unique, persistent codes (e.g., arXiv:2502.08482) to every submission. This system facilitates the reliable tracking, citation, and referencing of specific preprints referencing of specific preprints. These identifiers are essential for academic discourse, allowing researchers to ground their own work in the latest literature, whether they are conducting systematic reviews systematic literature reviews or addressing specific challenges like hallucination mitigation Chain-of-Verification Reduces Hallucination.

The significance of arXiv extends beyond mere hosting; it acts as a primary venue for influential research influential research papers and a critical resource for academic surveying. By bridging the gap between initial discovery and formal publication, arXiv fosters a collaborative environment where specialized domains—such as medical AI Capabilities of gemini models in medicine, cybersecurity, and autonomous systems Trends Research & Advisory—can evolve through continuous, public scrutiny and iterative feedback.

Model Perspectives (4)
openrouter/google/gemini-3.1-flash-lite-preview definitive 100% confidence
arXiv is a prominent repository used by the research community for the publication of preprints, serving as a primary source for disseminating scientific and technical advancements. It functions as a critical platform for researchers to share findings across diverse fields such as computer science, artificial intelligence, and machine learning. Its significance is evidenced by its role as a key venue for influential research papers and its frequent citation in comprehensive academic surveys. Researchers utilize arXiv to document a wide array of innovations, including advancements in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and knowledge graph integration. The platform is also instrumental for systematic literature reviews, enabling scholars to track developments in specialized domains like medical AI, cybersecurity, and neural-symbolic learning. Furthermore, arXiv provides a standardized identifier system (e.g., arXiv:2502.08482), which facilitates the reliable tracking and referencing of specific preprints.
openrouter/google/gemini-3.1-flash-lite-preview definitive 100% confidence
arXiv serves as a prominent repository for scholarly research, particularly in the fields of artificial intelligence and large language models (LLMs). It functions as a platform for authors to publish preprints, enabling the dissemination of research findings such as surveys A Survey on the Theory and Mechanism of Large Language Models, technical architectural improvements Gated delta networks: improving mamba2 with delta rule, and applications in specialized domains like medicine Capabilities of gemini models in medicine. Beyond simple hosting, arXiv acts as a foundational resource for academic surveying, with numerous papers citing arXiv preprints to ground their discussions on topics such as model evaluation Evaluating large language models: a comprehensive survey, hallucination mitigation Chain-of-Verification Reduces Hallucination, and retrieval-augmented generation (RAG) Evaluation of retrieval-augmented generation: A survey. The platform is also central to research on knowledge graphs and their integration with LLMs Knowledge graph large language model (kg-llm) for link prediction, KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment. According to arXiv, the organization is guided by core values including openness, community, excellence, and the protection of user data privacy arXiv is committed to the values of openness.
openrouter/google/gemini-3.1-flash-lite-preview 100% confidence
arXiv functions as a primary repository for preprints in the fields of artificial intelligence and machine learning. It serves as a critical distribution platform for foundational research, including seminal work such as Mikolov's 2013 paper on word representations Efficient estimation of word representations and the 2017 paper on Proximal Policy Optimization Proximal policy optimization algorithms. The repository is frequently referenced in comprehensive academic surveys, such as 'A Survey on the Theory and Mechanism of Large Language Models,' which utilizes arXiv preprints to ground its analysis of LLM watermarking Theoretically grounded framework for llm watermarking, scalable oversight Improving weak-to-strong generalization, reinforcement learning Spurious rewards: rethinking training signals, and transformer architecture Reasoning with latent thoughts. Additionally, arXiv hosts diverse contemporary research covering topics ranging from neuro-symbolic AI A Study on Neuro-Symbolic Artificial Intelligence and agentic memory systems A-MEM, an agentic memory to benchmarks for multimodal generation MRAMG-Bench: A BeyondText Benchmark.
openrouter/x-ai/grok-4.1-fast 100% confidence
arXiv serves as a central preprint repository and server for academic papers, particularly in AI, machine learning, and related fields, hosting numerous preprints before formal publication. It is described as the 'arXiv preprint server' hosting papers like 'Generalized Measures of Information Transfer' eLife publication on arXiv and 'Integrated information theory (IIT) 4.0' from 2022 eLife reference. Sources such as eLife, Trends Research & Advisory, and arXiv itself attribute preprints to it, including 'Agentic Artificial Intelligence and Autonomous Cyber Operations' in 2025 Trends Research & Advisory and specific identifiers like arXiv:2505.24313 for weak-to-strong generalization arXiv citation. arXiv connects to diverse research areas, cited in surveys on LLMs for works on watermarking (arXiv:2410.02890) arXiv survey reference, scalable oversight (arXiv:2402.00667) arXiv survey section, and neuro-symbolic architectures (arXiv:2205.00445) by authors like Ehud Karpas et al. arXiv authorship, as well as earlier works like word representations (arXiv:1301.3781) by Tomas Mikolov arXiv; Springer. Publications from GitHub, Heriot-Watt University, Zylos, and others further link arXiv to topics like multimodal benchmarks and hallucination detection.

Facts (140)

Sources
A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv Mar 12, 2026 50 facts
referenceThe paper 'Enhancing auto-regressive chain-of-thought through loop-aligned reasoning' is an arXiv preprint with identifier arXiv:2502.08482.
referenceThe research paper 'On llms-driven synthetic data generation, curation, and evaluation: a survey' was published as an arXiv preprint (arXiv:2505.10559) and cited in section 3.2.2 of the survey.
referenceThe paper 'Rnns are not transformers (yet): the key bottleneck on in-context retrieval' is an arXiv preprint (arXiv:2402.18510).
referenceThe paper 'Deepseek-r1: incentivizing reasoning capability in llms via reinforcement learning' (arXiv:2501.12948) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding reasoning capabilities.
referenceThe paper 'From low intrinsic dimensionality to non-vacuous generalization bounds in deep multi-task learning' is an arXiv preprint (arXiv:2501.19067) cited in section 2.2.1 of 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Vision superalignment: weak-to-strong generalization for vision foundation models' (arXiv:2402.03749) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding alignment.
referenceThe paper 'Demystify mamba in vision: a linear attention perspective' (arXiv:2405.16605) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding linear attention.
referenceThe paper 'Looped transformers are better at learning learning algorithms' is an arXiv preprint (arXiv:2311.12424) that investigates the capability of looped transformers to learn algorithms.
referenceThe paper 'A mathematical exploration of why language models help solve downstream tasks' is an arXiv preprint (arXiv:2010.03648) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Transformers, parallel computation, and logarithmic depth' is an arXiv preprint (arXiv:2402.09268) cited in section 3.2.1 of 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Ampo: automatic multi-branched prompt optimization' is an arXiv preprint (arXiv:2410.08696) that introduces a method for automatic multi-branched prompt optimization.
referenceThe paper 'Connecting large language models with evolutionary algorithms yields powerful prompt optimizers' (arXiv:2309.08532) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding prompt optimization.
referenceThe paper 'Jailbreak attacks and defenses against large language models: a survey' is an arXiv preprint with identifier arXiv:2407.04295.
referenceThe paper 'Trustllm: trustworthiness in large language models' is an arXiv preprint, identified as arXiv:2401.05561.
referenceThe paper 'Can you trust llm judgments? reliability of llm-as-a-judge' is an arXiv preprint (arXiv:2412.12509) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Benchmark data contamination of large language models: a survey' is an arXiv preprint (arXiv:2406.04244).
referenceThe paper 'Large language model alignment: a survey' is an arXiv preprint (arXiv:2309.15025) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Data mixing laws: optimizing data mixtures by predicting language modeling performance' is an arXiv preprint with identifier arXiv:2403.16952.
referenceThe paper 'Contranorm: a contrastive learning perspective on oversmoothing and beyond' (arXiv:2303.06562) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding contrastive learning.
referenceThe research paper 'Neural thermodynamic laws for large language model training' was published as an arXiv preprint (arXiv:2402.15505) and cited in section 5.2.1 of the survey.
referenceThe paper 'The prompt report: a systematic survey of prompt engineering techniques' is an arXiv preprint (arXiv:2406.06608) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'DeepSeekMath: pushing the limits of mathematical reasoning in open language models' is an arXiv preprint (arXiv:2402.03300) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Gated delta networks: improving mamba2 with delta rule' is an arXiv preprint (arXiv:2412.06464) that proposes improvements to the Mamba2 architecture using the delta rule.
referenceThe paper 'An explanation of in-context learning as implicit bayesian inference' is an arXiv preprint (arXiv:2111.02080).
referenceThe paper 'STanhop: sparse tandem hopfield model for memory-enhanced time series prediction' is an arXiv preprint (arXiv:2312.17346).
referenceThe paper 'Rest-mcts*: llm self-training via process reward guided tree search' is an arXiv preprint (arXiv:2406.03816) cited in section 6.2.3 of 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'A taxonomy for data contamination in large language models' is an arXiv preprint, identified as arXiv:2407.08716.
referenceThe paper 'Benefits of transformer: in-context learning in linear regression tasks with unstructured data' is an arXiv preprint (arXiv:2402.00743).
referenceThe research paper 'On robustness and reliability of benchmark-based evaluation of llms' was published as an arXiv preprint (arXiv:2509.04013).
referenceThe paper 'Language models represent space and time' (arXiv:2310.02207) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding representation.
referenceThe paper 'Large language model safety: a holistic survey' is an arXiv preprint (arXiv:2412.17686) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Are transformers universal approximators of sequence-to-sequence functions?' is an arXiv preprint with identifier arXiv:1912.10077.
referenceThe paper 'Autoprompt: eliciting knowledge from language models with automatically generated prompts' is an arXiv preprint (arXiv:2010.15980) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Compression represents intelligence linearly' is an arXiv preprint, identified as arXiv:2404.09937.
referenceThe paper 'The mosaic memory of large language models' is an arXiv preprint (arXiv:2405.15523) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Evaluating large language models: a comprehensive survey' (arXiv:2310.19736) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding LLM evaluation.
referenceThe paper 'On protecting the data privacy of large language models (llms): a survey' is an arXiv preprint (arXiv:2403.05156) that reviews data privacy concerns regarding large language models.
referenceThe paper 'How close is chatgpt to human experts? comparison corpus, evaluation, and detection' (arXiv:2301.07597) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding LLM evaluation.
referenceThe paper 'Training large language models to reason in a continuous latent space' (arXiv:2412.06769) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding reasoning.
referenceThe paper 'Gated linear attention transformers with hardware-efficient training' is an arXiv preprint (arXiv:2312.06635) that discusses gated linear attention transformers and their training efficiency.
referenceThe paper 'Knowledge-infused prompting: assessing and advancing clinical text data generation with large language models' is an arXiv preprint (arXiv:2311.00287) that explores the intersection of large language models and clinical data generation.
referenceThe paper 'Emergence of segmentation with minimalistic white-box transformers' is an arXiv preprint with identifier arXiv:2308.16271.
referenceThe paper 'On the emergence of weak-to-strong generalization: a bias-variance perspective' is an arXiv preprint (arXiv:2505.24313).
referenceThe paper 'Theoretically grounded framework for llm watermarking: a distribution-adaptive approach' (arXiv:2410.02890) is cited in the survey 'A Survey on the Theory and Mechanism of Large Language Models' regarding watermarking.
referenceThe paper 'Improving weak-to-strong generalization with scalable oversight and ensemble learning' is an arXiv preprint (arXiv:2402.00667) cited in section 5.2.1 of 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Self-attention networks can process bounded hierarchical languages' is an arXiv preprint (arXiv:2105.11115) that demonstrates the capability of self-attention networks to process bounded hierarchical languages.
referenceThe paper 'Entropy-memorization law: evaluating memorization difficulty of data in llms' is an arXiv preprint, identified as arXiv:2507.06056.
referenceThe paper 'Proximal policy optimization algorithms' is an arXiv preprint (arXiv:1707.06347) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Reasoning with latent thoughts: on the power of looped transformers' is an arXiv preprint (arXiv:2502.17416) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
referenceThe paper 'Spurious rewards: rethinking training signals in rlvr' is an arXiv preprint (arXiv:2506.10947) cited in 'A Survey on the Theory and Mechanism of Large Language Models'.
LLM-empowered knowledge graph construction: A survey - arXiv arxiv.org arXiv Oct 23, 2025 16 facts
referenceTianshu Wang, Xiaoyang Chen, Hongyu Lin, Xuanang Chen, Xianpei Han, Hao Wang, Zhenyu Zeng, and Le Sun investigated the use of large language models for entity matching in their 2024 arXiv preprint.
referenceRui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, and Irene Li developed Graphusion, a method leveraging large language models for scientific knowledge graph fusion and construction in NLP education, as described in their 2024 arXiv preprint.
referenceYash Tiwari, Owais Ahmad Lone, and Mayukha Pal proposed OntoRAG, a system that enhances question-answering by automating ontology derivation from unstructured knowledge bases, as detailed in their 2025 arXiv preprint.
referenceYejin Kim, Eojin Kang, Juae Kim, and H. Howie Huang authored 'Causal Reasoning in Large Language Models: A Knowledge Graph Approach', published as an arXiv preprint in October 2024.
referenceAnna Sofia Lippolis, Mohammad Javad Saeedizade, Robin Keskisärkkä, Sara Zuppiroli, Miguel Ceriani, Aldo Gangemi, Eva Blomqvist, and Andrea Giovanni Nuzzolese authored the paper 'Ontology Generation using Large Language Models,' which was published as an arXiv preprint in March 2025.
referencePreston Rasmussen, Pavlo Paliychuk, Travis Beauvais, Jack Ryan, and Daniel Chalef published 'Zep: A Temporal Knowledge Graph Architecture for Agent Memory' as an arXiv preprint in January 2025.
referencePatricia Mateiu and Adrian Groza authored the paper 'Ontology engineering with Large Language Models,' which was published as an arXiv preprint in July 2023.
referenceJiaqi Sun, Shiyou Qian, Zhangchi Han, Wei Li, Zelin Qian, Dingyu Yang, Jian Cao, and Guangtao Xue developed LKD-KGC, a method for domain-specific knowledge graph construction using LLM-driven knowledge dependency parsing, as described in their 2025 arXiv preprint.
referenceGerard Pons, Besim Bilalli, and Anna Queralt published 'Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation' as an arXiv preprint in 2025.
referenceYuxing Lu and Jinzhuo Wang authored the paper 'KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment,' which was published as an arXiv preprint in February 2025.
referenceSamira Khorshidi, Azadeh Nikfarjam, Suprita Shankar, Yisi Sang, Yash Govind, Hyun Jang, Ali Kasgari, Alexis McClimans, Mohamed Soliman, Vishnu Konda, Ahmed Fakhry, and Xiaoguang Qi authored 'ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs', published as an arXiv preprint in September 2025.
referenceYe et al. (2023) authored 'Schema-adaptable Knowledge Graph Construction', published as an arXiv preprint (arXiv:2305.08703) in November 2023.
referenceJunming Liu, Siyuan Meng, Yanting Gao, Song Mao, Pinlong Cai, Guohang Yan, Yirong Chen, Zilin Bian, Ding Wang, and Botian Shi authored the paper 'Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning,' which was published as an arXiv preprint in July 2025.
referenceXiang Wei, Xingyu Cui, Ning Cheng, Xiaobin Wang, Xin Zhang, Shen Huang, Pengjun Xie, Jinan Xu, Yufeng Chen, Meishan Zhang, Yong Jiang, and Wenjuan Han introduced ChatIE, a method for zero-shot information extraction via chatting with ChatGPT, in their 2024 arXiv preprint.
referenceWenjie Wu, Yongcheng Jing, Yingjie Wang, Wenbin Hu, and Dacheng Tao developed Graph-Augmented Reasoning, a method for evolving step-by-step knowledge graph retrieval for LLM reasoning, as presented in their 2025 arXiv preprint.
referenceWujiang Xu, Zujie Liang, Kai Mei, Hang Gao, Juntao Tan, and Yongfeng Zhang proposed A-MEM, an agentic memory system for LLM agents, in their 2025 arXiv preprint.
LLM-KG4QA: Large Language Models and Knowledge Graphs for ... github.com GitHub 16 facts
referenceThe paper 'Benchmarking Large Language Models in Complex Question Answering Attribution using Knowledge Graphs' was published on arXiv in 2024, utilizes the CAQA dataset, and is categorized under KBQA and KGQA.
referenceThe paper 'Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs' (arXiv, 2024) discusses incorporating knowledge graphs to enhance the domain expertise of Large Language Models.
referenceThe paper 'mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs' (arXiv, 2025) introduces a modular benchmark for evaluating retrieval-augmented generation across text, tables, and knowledge graphs.
referenceThe paper 'ER-RAG: Enhance RAG with ER-Based Unified Modeling of Heterogeneous Data Sources' was published on arXiv in 2025 and focuses on RDB QA.
referenceThe paper 'Ontology-Aware RAG for Improved Question-Answering in Cybersecurity Education' (arXiv, 2024) explores the use of ontology-aware retrieval-augmented generation for cybersecurity education.
referenceThe paper 'CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge' was published on arXiv in 2024, utilizes the CR-LT-KGQA dataset, and is categorized under KBQA and KGQA.
referenceThe paper 'GTR: Graph-Table-RAG for Cross-Table Question Answering' was published on arXiv in 2025 and focuses on RDB QA.
referenceEICopilot is a system designed to search and explore enterprise information over large-scale knowledge graphs using Large Language Model-driven agents (arXiv, 2025).
referenceThe paper 'A Prompt Engineering Approach and a Knowledge Graph based Framework for Tackling Legal Implications of Large Language Model Answers' (arXiv, 2024) proposes a framework combining prompt engineering and knowledge graphs to address legal implications in Large Language Model outputs.
referenceThe paper 'WebFAQ: A Multilingual Collection of Natural Q&A Datasets for Dense Retrieval' published on arXiv in 2025 introduces the WebFAQ collection for multi-domain multilingual question answering.
claimThe preprint of the survey 'LLM-KG4QA: Large Language Models and Knowledge Graphs for QA' was made available on arXiv in May 2025.
referenceThe paper 'MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge' was published on arXiv in 2024, utilizes the MINTQA dataset, and is categorized under Multi-hop QA.
referenceThe paper 'KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation' (arXiv, 2024) explores the use of Knowledge Augmented Generation to improve Large Language Models in professional domains.
referenceThe paper 'MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation' (arXiv, 2025) by LiHua-World proposes a simplified approach to retrieval-augmented generation.
referenceThe paper 'An Empirical Study over Open-ended Question Answering' (arXiv, 2024) investigates the OKGQA framework for Large Language Models and Knowledge Graphs in question answering.
referenceThe paper 'MRAMG-Bench: A BeyondText Benchmark for Multimodal Retrieval-Augmented Multimodal Generation' published on arXiv in 2025 introduces the MRAMG benchmark for multi-modal question answering.
KG-IRAG: A Knowledge Graph-Based Iterative Retrieval-Augmented ... arxiv.org arXiv Mar 18, 2025 13 facts
referenceJunde Wu, Jiayuan Zhu, and Yunli Qi authored the paper 'Medical graph rag: Towards safe medical large language model via graph retrieval-augmented generation', published as arXiv preprint arXiv:2408.04187 in 2024.
referenceYuzhe Zhang, Yipeng Zhang, Yidong Gan, Lina Yao, and Chen Wang authored the paper 'Causal graph discovery with retrieval-augmented generation based large language models', published as arXiv preprint arXiv:2402.15301 in 2024.
referenceQingyu Tan, Hwee Tou Ng, and Lidong Bing authored the paper 'Towards benchmarking and improving the temporal reasoning capability of large language models', published as arXiv preprint arXiv:2306.08952 in 2023.
referenceYuwei Xia, Ding Wang, Qiang Liu, Liang Wang, Shu Wu, and Xiaoyu Zhang authored the paper 'Enhancing temporal knowledge graph forecasting with large language models via chain-of-history reasoning', published as arXiv preprint arXiv:2402.14382 in 2024.
referenceFei Wang, Xingchen Wan, Ruoxi Sun, Jiefeng Chen, and Sercan Ö Arık authored the paper 'Astute rag: Overcoming imperfect retrieval augmentation and knowledge conflicts for large language models', published as arXiv preprint arXiv:2410.07176 in 2024.
referenceTianjun Zhang, Shishir G Patil, Naman Jain, Sheng Shen, Matei Zaharia, Ion Stoica, and Joseph E Gonzalez authored the paper 'Raft: Adapting language model to domain specific rag', published as arXiv preprint arXiv:2403.10131 in 2024.
referenceThe research paper 'Factify5wqa: Fact verification through 5w question-answering' is available as arXiv preprint arXiv:2410.04236.
referenceSiheng Xiong, Ali Payani, Ramana Kompella, and Faramarz Fekri authored the paper 'Large language models can learn temporal reasoning', published as arXiv preprint arXiv:2401.06853 in 2024.
referenceHugo Touvron et al. authored the paper 'Llama: Open and efficient foundation language models', published as arXiv preprint arXiv:2302.13971 in 2023.
referenceDiego Sanmartin authored 'Kg-rag: Bridging the gap between knowledge and creativity', published as an arXiv preprint (arXiv:2405.12035).
referenceHao Yu, Aoran Gan, Kai Zhang, Shiwei Tong, Qi Liu, and Zhaofeng Liu authored the paper 'Evaluation of retrieval-augmented generation: A survey', published as arXiv preprint arXiv:2405.07437 in 2024.
referenceDong Shu, Tianle Chen, Mingyu Jin, Yiting Zhang, Mengnan Du, and Yongfeng Zhang authored 'Knowledge graph large language model (kg-llm) for link prediction', published as an arXiv preprint (arXiv:2403.07311).
referencePenghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, and Bin Cui authored the paper 'Retrieval-augmented generation for ai-generated content: A survey', published as arXiv preprint arXiv:2402.19473 in 2024.
Neuro-Symbolic AI: Explainability, Challenges, and Future Trends arxiv.org arXiv Nov 7, 2024 11 facts
referenceHang Jiang, Sairam Gurajada, Qiuhao Lu, Sumit Neelam, Lucian Popa, Prithviraj Sen, Yunyao Li, and Alexander Gray authored 'LNN-EL: A neuro-symbolic approach to short-text entity linking', published as an arXiv preprint (arXiv:2106.09795) in 2021.
referenceMarconato et al. (2023) published research on neuro-symbolic continual learning, focusing on knowledge, reasoning shortcuts, and concept rehearsal, as an arXiv preprint.
referenceArabshahi et al. (2018) proposed a method for combining symbolic expressions and black-box function evaluations in neural programs, published as an arXiv preprint.
referenceBowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Weijie J Su, Camillo J Taylor, and Tanwi Mallick authored the survey 'Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey', published as an arXiv preprint (arXiv:2406.00252) in 2024.
referenceMajumdar et al. (2023) presented Symbolic Regression for PDEs using Pruned Differentiable Programs, published as an arXiv preprint.
referenceFadi Al Machot (2023) introduced ASPER, a neural-symbolic approach for enhanced reasoning in neural models, published as an arXiv preprint.
referenceMarconato et al. (2024) introduced BEARS, a method to make neuro-symbolic models aware of their reasoning shortcuts, published as an arXiv preprint.
referencePavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, et al. authored 'Leveraging abstract meaning representation for knowledge base question answering', published as an arXiv preprint (arXiv:2012.01707) in 2020.
referenceMa et al. (2019) proposed a framework for generalizable neuro-symbolic systems for commonsense question answering, published as an arXiv preprint.
referenceEhud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, et al. authored 'MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning', published as an arXiv preprint (arXiv:2205.00445) in 2022.
referenceMao et al. (2019) introduced the neuro-symbolic concept learner, which interprets scenes, words, and sentences from natural supervision, published as an arXiv preprint.
Bridging the Gap Between LLMs and Evolving Medical Knowledge arxiv.org arXiv Jun 29, 2025 8 facts
referencePeng Xia et al. (2024) published 'Mmed-rag: Versatile multimodal rag system for medical vision language models' as an arXiv preprint (arXiv:2410.13085), detailing a multimodal RAG system for medical applications.
referenceShakhadri et al. (2024) published 'Shakti: A 2.5 billion parameter small language model optimized for edge ai and low-resource environments' as an arXiv preprint (arXiv:2410.11331).
referenceSanmartin (2024) published 'Kg-rag: Bridging the gap between knowledge and creativity' as an arXiv preprint (arXiv:2405.12035).
referenceHarsh Trivedi et al. (2022) published 'Interleaving retrieval with chain-of-thought reasoning for knowledge-intensive multi-step questions' as an arXiv preprint (arXiv:2212.10509), which discusses combining retrieval with reasoning.
referenceXuejiao Zhao et al. (2025) published 'Medrag: Enhancing retrieval-augmented generation with knowledge graph-elicited reasoning for healthcare copilot' as an arXiv preprint (arXiv:2502.04413), which focuses on improving RAG with knowledge graphs.
referenceRui Yang et al. (2024) published 'Kg-rank: Enhancing large language models for medical qa with knowledge graphs and ranking techniques' as an arXiv preprint (arXiv:2403.05881), which proposes using knowledge graphs and ranking to improve medical QA.
referenceSaab et al. (2024) published 'Capabilities of gemini models in medicine' as an arXiv preprint (arXiv:2404.18416).
referenceHongjian Zhou et al. (2023) published 'A survey of large language models in medicine: Progress, application, and challenge' as an arXiv preprint (arXiv:2311.05112).
LLM Hallucination Detection and Mitigation: State of the Art in 2026 zylos.ai Zylos Jan 27, 2026 7 facts
referenceThe paper 'Fact-Checking with LLMs via Probabilistic Certainty and Consistency,' published on arXiv, proposes a method for verifying facts using probabilistic certainty and consistency metrics.
referenceThe paper 'MiniCheck: Efficient Fact-Checking,' published on arXiv, introduces MiniCheck as an efficient method for verifying facts in AI-generated text.
referenceThe paper 'Chain-of-Verification Reduces Hallucination,' published on arXiv, presents the Chain-of-Verification method as a technique for reducing hallucinations in large language models.
referenceThe paper 'Integrative Decoding: Improve Factuality via Self-consistency,' published on arXiv, details an approach to improving the factuality of large language model outputs through self-consistency mechanisms.
referenceThe paper 'RAGAS: Automated Evaluation of RAG,' published on arXiv, introduces RAGAS as a framework for the automated evaluation of retrieval-augmented generation systems.
referenceThe paper 'A comprehensive taxonomy of hallucinations in LLMs,' published on arXiv, provides a structured classification system for different types of hallucinations in large language models.
referenceThe paper 'Predictive Coding and Information Bottleneck for Hallucination Detection,' published on arXiv, explores using predictive coding and information bottleneck principles to detect hallucinations in large language models.
A Synergistic Workspace for Human Consciousness Revealed by ... elifesciences.org eLife 2 facts
referenceThe paper titled 'Integrated information theory (IIT) 4.0: Formulating the properties of phenomenal existence in physical terms' was published as a preprint on arXiv in 2022.
referenceThe research paper titled 'Generalized Measures of Information Transfer' was published on the arXiv preprint server.
Unknown source 2 facts
referenceThe research paper titled 'Knowledge Graph-extended Retrieval Augmented Generation for Question Answering' is published on arXiv.
referenceThe research paper titled 'KG-RAG: Bridging the Gap Between Knowledge and Creativity' is published on arXiv.
A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org arXiv 2 facts
referenceWenchao Dong, Assem Zhunis, Dongyoung Jeong, Hyojin Chin, Jiyoung Han, and Meeyoung Cha authored 'Persona setting pitfall: Persistent outgroup biases in large language models arising from social identity adoption', published as an arXiv preprint in 2024.
measurementThe authors of 'A Survey of Incorporating Psychological Theories in LLMs' surveyed 175 papers from major computational linguistics venues (ACL Anthology), COLING, NeurIPS, ICML, ICLR, and influential arXiv preprints published between late 2021 and early 2025.
Unlocking the Potential of Generative AI through Neuro-Symbolic ... arxiv.org arXiv Feb 16, 2025 2 facts
referenceGary Marcus published the preprint 'Deep learning: A critical appraisal' on arXiv in 2018.
referenceTomas Mikolov authored 'Efficient estimation of word representations in vector space,' published as an arXiv preprint (arXiv:1301.3781) in 2013.
A Mixed-Methods Study of Open-Source Software Maintainers On ... arxiv.org arXiv Feb 3, 2025 1 fact
referenceJens Dietrich, Shawn Rasheed, Alexander Jordan, and Tim White published 'On the security blind spots of software composition analysis' as an arXiv preprint in 2023.
Beyond Missile Deterrence: The Rise of Algorithmic Superiority trendsresearch.org Trends Research & Advisory Mar 16, 2026 1 fact
referenceThe paper 'Agentic Artificial Intelligence and Autonomous Cyber Operations' was published on the arXiv repository in 2025.
Combining Knowledge Graphs With LLMs | Complete Guide - Atlan atlan.com Atlan Jan 28, 2026 1 fact
measurementResearch published in arXiv analyzing 28 integration methods found that hybrid approaches combining multiple patterns achieved the best results for complex enterprise use cases.
A survey on augmenting knowledge graphs (KGs) with large ... link.springer.com Springer Nov 4, 2024 1 fact
referenceZhao WX, Zhou K, Li J, Tang T, Wang X, Hou Y, Min Y, Zhang B, Zhang J, Dong Z et al. published 'A survey of large language models' as an arXiv preprint (arXiv:2303.18223) in 2023.
Combining Knowledge Graphs and Large Language Models - arXiv arxiv.org arXiv Jul 9, 2024 1 fact
procedureThe authors of 'Combining Knowledge Graphs and Large Language Models' conducted a review of literature published between 2019 and 2024, searching arXiv from February 2024 to May 2024 for articles related to LLMs and KGs.
Leveraging Knowledge Graphs and LLM Reasoning to Identify ... arxiv.org arXiv Jul 23, 2025 1 fact
referenceThe paper 'A survey of large language models' by Wayne Xin Zhao et al. was published as an arXiv preprint in 2023.
Knowledge Graphs vs RAG: When to Use Each for AI in 2026 - Atlan atlan.com Atlan Feb 12, 2026 1 fact
claimResearch published in arXiv demonstrates that KG²RAG (Knowledge Graph-Guided Retrieval Augmented Generation) frameworks, which utilize knowledge graphs to provide fact-level relationships between chunks, improve both response quality and retrieval quality compared to existing RAG approaches.
https://scholar.google.com/citations?view_op=view_... scholar.google.com Md Kamruzzaman Sarker, Lu Zhou, Aaron Eberhart, Pascal Hitzler · SAGE Publications 1 fact
referenceA preprint version of the article 'Neuro-Symbolic Artificial Intelligence' by Md Kamruzzaman Sarker, Lu Zhou, Aaron Eberhart, and Pascal Hitzler was published on arXiv in 2021 under the identifier arXiv:2105.05330.
KG-IRAG with Iterative Knowledge Retrieval - arXiv arxiv.org arXiv Mar 18, 2025 1 fact
claimarXiv is committed to the values of openness, community, excellence, and user data privacy.
Knowledge Graph Combined with Retrieval-Augmented Generation ... drpress.org Academic Journal of Science and Technology Dec 2, 2025 1 fact
referenceHe et al. introduced G-retriever, a retrieval-augmented generation framework for textual graph understanding and question answering, in an arXiv preprint in 2024.
Neural-Symbolic AI: The Next Breakthrough in Reliable and ... hu.ac.ae Heriot-Watt University Dec 29, 2025 1 fact
referenceThe paper 'A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives' by D. Hossain and J. Y. Chen was published as an arXiv preprint (arXiv:2503.18213) in 2025.