Agents
2,269Autonomous AI agents that perform tasks independently
It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.
Bedrock Knowledge Base and Agents for Retrieval Augmented Generation (RAG)
[🏆 CHI26 Best Paper] CoBRA: Reproducible control of LLM agent behavior via classic social science experiments
This repo covers LLM, Agents, MCP Tools, Skills concepts with sample codes: LangChain & LangGraph, AWS Strands Agents, Google Agent Development Kit, Fundamentals.
[WWW '25 Oral - GenMentor] Official code of our paper "LLM-powered Multi-agent Framework for Goal-oriented Learning in Intelligent Tutoring System", accepted by WWW 2025 (Industry Track) as an Oral Presentation.
Mathematical benchmark exposing the massive performance gap between real agents and LLM wrappers. Rigorous multi-dimensional evaluation with statistical validation (95% CI, Cohen's h) and reproducible methodology. Separates architectural theater from real systems through stress testing, network resilience, and failure analysis.
A cross-platform desktop client supporting multiple LLM providers, integrated with AI search, developer tools, and third-party AI tool access.一款支持多种大模型(LLM)提供商的跨平台桌面客户端。同时集成AI 搜索、开发者工具集与第三方 AI 工具入口。
Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
[NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Browser based Interface for Generative AI. Chat/Agent/Taskmanager Hybrid.
LLM Agent that leverages cheminformatics tools to provide informed responses.
ALICE and its prior work, Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality
LLM Agent paired with Image Captioning and Yolov8 models plays God of War
AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented generation (RAG), it provides precise legal insights, and contract summarization. With an intuitive Streamlit-based UI, analyze legal documents.
tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.
OrcaLoca: An LLM Agent Framework for Software Issue Localization [ICML 25]
HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research