Autonomous AI agents that perform tasks independently
NKAI-Decision-Team
LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmind's PySC2 Learning Environment API as a Python LLM Environment.
...morewladpaiva
Multi-Agent Conversation Framework in TypeScript
fuxiAIlab
CivAgent is an LLM-based Human-like Agent acting as a Digital Player within the Strategy Game Unciv.
microxxx
langchain 工具,流程设计组件,服务,代理以及相关学习文档的合集(agent,service,tutorials,flow-design)
open-compass
[NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents
wshi83
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
SALT-NLP
Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
ConcoLLMic
ConcoLLMic: the first language- and theory-agonistic concolic execution engine via LLM agents
kaymen99
AI tool for automating Upwork job applications using AI agents to find and qualify jobs, write personalized cover letters, and prepare for interviews based on your skills and experience.
...moreAmanPriyanshu
A curated list of tools, papers, and datasets for applying AI to cybersecurity tasks. This list primarily focuses on modern AI technologies like Large Language Models (LLMs), Agents, and Multi-Modal systems and their applications in security operations.
...moreNygenAnalytics
Multi-agent LLM driven cell type annotation for single-cell RNA-Seq data
nju-websoft
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024 (Distinguished Paper Award)
...morejlin816
DialOp: Decision-oriented dialogue environments for collaborative language agents
CrawlScript
Ultra-Lightweight, Pure Python Multimodal Agent.
llm-platform-security
An Execution Isolation Architecture for LLM-Based Agentic Systems
PKU-Alignment
AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models
mbzuai-oryx
[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
aws-samples
An agent based LLM assistant that extends RAG with batch entity extraction and SQL querying to improve performance on multi-step and analytical questions.
...morepromptdesk
Promptdesk is a tool designed for effectively creating, organizing, and evaluating prompts and large language models (LLMs).
...moredeep-symbolic-mathematics
[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models
oripress
AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each problem, and is faster than existing implementations.
...moremengysun
A simple yet versatile context engineered for scalable online data collection
yecchen
Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"
YuanchenBei
[Up-to-date] A curated list of resources on graph-empowered agents and agent-facilitated graph learning (Graphs Meet Agents).
...more