concept

ArXiv

Facts (51)

Sources

Unlocking the Potential of Generative AI through Neuro-Symbolic ... arxiv.org arXiv Feb 16, 2025 16 facts

referenceIshaan Singh, Navdeep Kaur, Garima Gaur, et al. authored 'Neustip: A novel neuro-symbolic model for link and time prediction in temporal knowledge graphs', published as an arXiv preprint (arXiv:2305.11301) in 2023.

referenceJunlin Xie, Zhihong Chen, Ruifei Zhang, Xiang Wan, and Guanbin Li published 'Large multimodal agents: A survey' as an arXiv preprint (arXiv:2402.15116) in 2024.

referenceKunlong Chen et al. developed a question-directed graph attention network for numerical reasoning over text, published as an arXiv preprint in 2020.

referenceAlexander I Cowen-Rivers, Pasquale Minervini, Tim Rocktaschel, Matko Bosnjak, Sebastian Riedel, and Jun Wang authored the paper 'Neural variational inference for estimating uncertainty in knowledge graph embeddings', published as an arXiv preprint in 2019.

referenceThe paper 'Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning' was published as an arXiv preprint in 2025.

referenceDmitry Lepikhin, HyoukJoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, and Zhifeng Chen published 'Gshard: Scaling giant models with conditional computation and automatic sharding' as an arXiv preprint (arXiv:2006.16668) in 2020.

referenceAlbert Q Jiang, Alexandre Sablayrolles, Antoine Roux, Arthur Mensch, Blanche Savary, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Emma Bou Hanna, Florian Bressand, et al. authored the paper 'Mixtral of experts', which was published as an arXiv preprint in 2024.

referenceZhixuan Liu, Zihao Wang, Yuan Lin, and Hang Li published 'A neural-symbolic approach to natural language understanding' on arXiv in 2022.

referenceYukun Huang, Yanda Chen, Zhou Yu, and Kathleen McKeown published 'In-context learning distillation: Transferring few-shot learning ability of pre-trained language models' as an arXiv preprint (arXiv:2212.10670) in 2022.

referenceZenan Li, Zhi Zhou, Yuan Yao, Yu-Feng Li, Chun Cao, Fan Yang, Xian Zhang, and Xiaoxing Ma authored 'Neuro-symbolic data generation for math reasoning', published as an arXiv preprint (arXiv:2412.04857) in 2024.

referenceGary Marcus published the preprint 'Deep learning: A critical appraisal' on arXiv in 2018.

referenceLuís C. Lamb, Artur Garcez, Marco Gori, Marcelo Prates, Pedro Avelar, and Moshe Vardi published 'Graph neural networks meet neural-symbolic computing: A survey and perspective' on arXiv in 2020.

referenceMiguel Angel Mendez-Lucero, Enrique Bojorquez Gallardo, and Vaishak Belle authored 'Semantic objective functions: A distribution-aware method for adding logical constraints in deep learning', published as an arXiv preprint (arXiv:2405.15789) in 2024.

referenceDou Hu, Lingwei Wei, and Xiaoyong Huai developed 'DialogueCRN', a contextual reasoning network for emotion recognition in conversations, published as an arXiv preprint in 2021.

referenceCanran Xu and Ruijiang Li authored the paper 'Relation embedding with dihedral group in knowledge graph', published as an arXiv preprint in 2019.

referenceJosef Dai, Xuehai Pan, Ruiyang Sun, Jiaming Ji, Xinbo Xu, Mickel Liu, Yizhou Wang, and Yaodong Yang authored 'Safe rlhf: Safe reinforcement learning from human feedback', published as an arXiv preprint in 2023.

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv Mar 12, 2026 15 facts

referenceThe paper 'Language modeling is compression' is an arXiv preprint (arXiv:2309.10668).

referenceThe paper 'Transformers are ssms: generalized models and efficient algorithms through structured state space duality' is an arXiv preprint (arXiv:2405.21060).

referenceThe paper 'In-context learning with transformers: softmax attention adapts to function lipschitzness' is an arXiv preprint (arXiv:2402.11639) regarding in-context learning.

referenceThe paper 'Towards reasoning era: a survey of long chain-of-thought for reasoning large language models' is an arXiv preprint, identified as arXiv:2503.09567.

referenceThe paper 'Universal transformers' is an arXiv preprint (arXiv:1807.03819).

referenceThe paper 'CausalLM is not optimal for in-context learning' is an arXiv preprint, identified as arXiv:2308.06912.

referenceThe paper 'Provably robust watermarks for open-source language models' is an arXiv preprint (arXiv:2410.18861) cited in the context of language model security.

referenceThe paper 'Memorization or interpolation? detecting llm memorization through input perturbation analysis' is an arXiv preprint, identified as arXiv:2505.03019.

referenceThe paper 'Rethinking attention with performers' is an arXiv preprint (arXiv:2009.14794) cited in the context of attention mechanisms in large language models.

referenceThe paper 'Muon optimizes under spectral norm constraints' is an arXiv preprint, identified as arXiv:2506.15054.

referenceThe paper 'Transformers implement functional gradient descent to learn non-linear functions in context' is an arXiv preprint, identified as arXiv:2312.06528.

referenceThe paper 'A survey on data contamination for large language models' is an arXiv preprint, identified as arXiv:2502.14425.

referenceThe paper 'A survey for in-context learning' is an arXiv preprint, identified as arXiv:2301.00234.

referenceThe paper 'Revisiting chain-of-thought prompting: zero-shot can be stronger than few-shot' is an arXiv preprint, identified as arXiv:2506.14641.

referenceThe paper 'Theoretical limitations of multi-layer transformer' is an arXiv preprint, identified as arXiv:2412.02975.

Understanding LLM Understanding skywritingspress.ca Skywritings Press Jun 14, 2024 9 facts

referenceSalvatori, T., Mali, A., Buckley, C. L., Lukasiewicz, T., Rao, R. P., Friston, K., & Ororbia, A. (2023) published 'Brain-inspired computational intelligence via predictive coding' as an arXiv preprint (arXiv:2308.07870).

referenceFutrell, R. & Hahn, M. (2024) Linguistic Structure from a Bottleneck on Sequential Information Processing. arXiv preprint arXiv:2405.12109.

referenceMillhouse, T., Moses, M., & Mitchell, M. (2022). 'Embodied, Situated, and Grounded Intelligence: Implications for AI.' arXiv preprint arXiv:2210.13589.

referenceFriston, K. J., Da Costa, L., Tschantz, A., Kiefer, A., Salvatori, T., Neacsu., Neacsu, V., & Buckley, C. L. (2023) published 'Supervised structure learning' as an arXiv preprint (arXiv:2311.10300).

referenceMerullo, Jack, Carsten Eickhoff, and Ellie Pavlick. “Language Models Implement Simple Word2Vec-style Vector Arithmetic.” arXiv preprint arXiv:2305.16130 (2023).

referencePeriti, F., Cassotti, P., Dubossarsky, H., & Tahmasebi, N. (2024) authored the paper 'Analyzing Semantic Change through Lexical Replacements', published as an arXiv preprint (arXiv:2404.18570).

referenceLepori, Michael A., Thomas Serre, and Ellie Pavlick. “Break it down: evidence for structural compositionality in neural networks.” arXiv preprint arXiv:2301.10884 (2023). https://arxiv.org/pdf/2301.10884.pdf

referenceKallini, J., Papadimitriou, I., Futrell, R., Mahowald, K., & Potts, C. (2024). Mission: Impossible language models. arXiv preprint arXiv:2401.06416.

referenceBai, Y., Geng, X., Mangalam, K., Bar, A., Yuille, A., Darrell, T., Malik, J. and Efros, A.A. (2023) authored the paper 'Sequential modeling enables scalable learning for large vision models', published as an arXiv preprint (arXiv:2312.00785).

The Synergy of Symbolic and Connectionist AI in LLM-Empowered ... arxiv.org arXiv Jul 11, 2024 3 facts

referenceJacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova introduced the BERT model for deep bidirectional transformer-based language understanding in a 2018 arXiv preprint.

referenceJosh Achiam et al. published the GPT-4 technical report as an arXiv preprint in 2023.

referenceZhiheng Xi et al. published 'The rise and potential of large language model based agents: A survey' as an arXiv preprint (arXiv:2309.07864) in 2023.

KG-IRAG: A Knowledge Graph-Based Iterative Retrieval-Augmented ... arxiv.org arXiv Mar 18, 2025 2 facts

referenceHu et al. (2024) published 'Grag: Graph retrieval-augmented generation' in arXiv preprint arXiv:2405.16506, which proposes a graph-based retrieval-augmented generation framework.

referenceGao et al. (2023) published 'Retrieval-augmented generation for large language models: A survey' in arXiv preprint arXiv:2312.10997, providing a survey on RAG techniques for LLMs.

Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org arXiv 2 facts

referenceZhang et al. (2023) authored the paper titled 'Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models', published as arXiv:2309.01219.

referenceChen et al. (2023) authored 'PURR: Efficiently Editing Language Model Hallucinations by Denoising Language Model Corruptions', published as arXiv preprint arXiv:2305.14908.

Re-evaluating Hallucination Detection in LLMs - arXiv arxiv.org arXiv Aug 13, 2025 1 fact

referenceThe paper 'How Language Model Hallucinations Can Snowball' by Muru Zhang, Ofir Press, William Merrill, Alisa Liu, and Noah A. Smith was published as ArXiv:2305.13534.

Knowledge Graphs: Opportunities and Challenges - Springer Nature link.springer.com Springer Apr 3, 2023 1 fact

claimYao L, Mao C, Luo Y published the paper 'Kg-bert: Bert for knowledge graph completion' as an arXiv preprint in 2019.

The Hallucinations Leaderboard, an Open Effort to Measure ... huggingface.co Hugging Face Jan 29, 2024 1 fact

referenceThe Hallucinations Leaderboard project team released a paper titled 'The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models', which is available on arXiv.

Empowering GraphRAG with Knowledge Filtering and Integration arxiv.org arXiv Mar 18, 2025 1 fact

referenceHaoyu Han, Yu Wang, Harry Shomer, Kai Guo, Jiayuan Ding, Yongjia Lei, Mahantesh Halappanavar, Ryan A Rossi, Subhabrata Mukherjee, Xianfeng Tang, et al. authored 'Retrieval-augmented generation with graphs (graphrag)', published as an arXiv preprint (arXiv:2501.00309).