Search
tokenspeed-trtllm-attn
Name reserved for the tokenspeed-trtllm-attn project.
llm-contextlens
Compress your local LLM KV cache with 5.3× memory reduction
tokenspeed-trtllm-common
Name reserved for the tokenspeed-trtllm-common project.
tokenspeed-trtllm-gemm
Name reserved for the tokenspeed-trtllm-gemm project.
alchemylab
alchemy_lab
AlchemyLab - agentic coding IDEm
sw-metadata-bot
Metadata quality bot for software repositories, leveraging metacheck for analysis and GitHub/GitLab APIs for issue management.
...morepynlqe
Natural language to SQL query engine powered by LangChain and DuckDB
vllm-factory
The LEGO set for custom vLLM model plugins — build, test, and deploy custom encoders, poolers, and kernels
mlx-serve
mlx-serve contributors
Local inference server for Apple Silicon that hot-swaps MLX models (LLM, vision, embeddings, TTS, STT) via OpenAI-compatible API
...moreaugllm
This is LLM interface library.
onecomp
Keiji Kimura
Python package for LLM compression
sandstrike
A comprehensive Python library and CLI tool to perform LLM Red Teaming with Avenlis SandStrike.
frontal-ai
Frontal Labs
AI service client for the Frontal Python SDK — text, embeddings, images, speech, video, and more
frontal-blob
Frontal Labs
Blob storage client for the Frontal Python SDK
preambulate
Graph-based project memory for Claude Code — semantic density over transcript volume
msaas-rag
RAG pipeline library — chunking, embeddings, vector search, and retrieval for the Willian SaaS platform
msaas-audit-log
Immutable audit log library with PostgreSQL storage for the Willian SaaS platform
ghp
Stateless GitHub activity summary — compact, LLM-friendly output
search-claude-history
Search across Claude Code session history
validedi
A modern, configuration-driven X12 EDI parser and validator for healthcare transactions with optional LLM-powered explanations
...more