back to home

KalyanKS-NLP / llm-engineer-toolkit

A curated list of 120+ LLM libraries category wise.

9,824 stars
1,579 forks
16 issues

AI Architecture Analysis

This repository is indexed by RepoMind. By analyzing KalyanKS-NLP/llm-engineer-toolkit in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.

Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.

Embed this Badge

Showcase RepoMind's analysis directly in your repository's README.

[![Analyzed by RepoMind](https://img.shields.io/badge/Analyzed%20by-RepoMind-4F46E5?style=for-the-badge)](https://repomind-ai.vercel.app/repo/KalyanKS-NLP/llm-engineer-toolkit)
Preview:Analyzed by RepoMind

Repository Summary (README)

Preview

👨🏻‍💻 LLM Engineer Toolkit

This repository contains a curated list of 120+ LLM libraries category wise.

<p align="center"> <a href="https://www.linkedin.com/in/kalyanksnlp/"> <img src="https://custom-icon-badges.demolab.com/badge/Kalyan%20KS-0A66C2?logo=linkedin-white&logoColor=fff" alt="LinkedIn"> </a> <a href="https://x.com/kalyan_kpl"> <img src="https://img.shields.io/badge/Kalyan%20KS-%23000000.svg?logo=X&logoColor=white" alt="Twitter"> </a> <a href="https://www.youtube.com/@kalyanksnlp"> <img src="https://img.shields.io/badge/Kalyan%20KS-%23FF0000.svg?logo=YouTube&logoColor=white" alt="Twitter"> </a> </p>

🚀 LLM Interview Questions and Answers Book

Crack modern LLM and Generative AI interviews with this comprehensive, interview-focused guide designed specifically for ML Engineers, AI Engineers, Data Scientists and Software Engineers.

This book features 100+ carefully curated LLM interview questions, each paired with clear answers and in-depth explanations so you truly understand the concepts interviewers care about. Get the book here.

Use the Coupon Code: LLMQA25 for an exclusive discount (50%) on the book. (Available only for a short period of time).

LLM Interview Questions and Answers Book by Kalyan KS

Related Repositories

Stay Updated with Generative AI, LLMs, Agents and RAG.

Join 🚀 AIxFunda free newsletter to get latest updates and interesting tutorials related to Generative AI, LLMs, Agents and RAG.

  • ✨ Weekly GenAI updates
  • 📄 Weekly LLM, Agents and RAG paper updates
  • 📝 1 fresh blog post on an interesting topic every week

Quick links

🚀 LLM Training🧱 LLM Application Development🩸LLM RAG
🟩 LLM Inference🚧 LLM Serving📤 LLM Data Extraction
🌠 LLM Data Generation💎 LLM Agents⚖️ LLM Evaluation
🔍 LLM Monitoring📅 LLM Prompts📝 LLM Structured Outputs
🛑 LLM Safety and Security💠 LLM Embedding Models❇️ Others

LLM Training and Fine-Tuning

LibraryDescriptionLink
unslothFine-tune LLMs faster with less memory.Link
PEFTState-of-the-art Parameter-Efficient Fine-Tuning library.Link
TRLTrain transformer language models with reinforcement learning.Link
TransformersTransformers provides thousands of pretrained models to perform tasks on different modalities such as text, vision, and audio.Link
AxolotlTool designed to streamline post-training for various AI models.Link
LLMBoxA comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.Link
LitGPTTrain and fine-tune LLM lightning fast.Link
MergooA library for easily merging multiple LLM experts, and efficiently train the merged LLM.Link
Llama-FactoryEasy and efficient LLM fine-tuning.Link
LudwigLow-code framework for building custom LLMs, neural networks, and other AI models.Link
TxtinstructA framework for training instruction-tuned models.Link
LaminiAn integrated LLM inference and tuning platform.Link
XTuringxTuring provides fast, efficient and simple fine-tuning of open-source LLMs, such as Mistral, LLaMA, GPT-J, and more.Link
RL4LMsA modular RL library to fine-tune language models to human preferences.Link
DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.Link
torchtuneA PyTorch-native library specifically designed for fine-tuning LLMs.Link
PyTorch LightningA library that offers a high-level interface for pretraining and fine-tuning LLMs.Link

LLM Application Development

<p align = "center"> <b> Frameworks </b> </p>
LibraryDescriptionLink
LangChainLangChain is a framework for developing applications powered by large language models (LLMs).Link
Llama IndexLlamaIndex is a data framework for your LLM applications.Link
HayStackHaystack is an end-to-end LLM framework that allows you to build applications powered by LLMs, Transformer models, vector search and more.Link
Prompt flowA suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications.Link
GriptapeA modular Python framework for building AI-powered applications.Link
WeaveWeave is a toolkit for developing Generative AI applications.Link
Llama StackBuild Llama Apps.Link
<p align = "center"> <b> Data Preparation </b> </p>
LibraryDescriptionLink
Data Prep KitData Prep Kit accelerates unstructured data preparation for LLM app developers. Developers can use Data Prep Kit to cleanse, transform, and enrich use case-specific unstructured data to pre-train LLMs, fine-tune LLMs, instruct-tune LLMs, or build RAG applications.Link
<p align = "center"> <b> Multi API Access </b> </p>
LibraryDescriptionLink
LiteLLMLibrary to call 100+ LLM APIs in OpenAI format.Link
AI GatewayA Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.Link
<p align = "center"> <b> Routers </b> </p>
LibraryDescriptionLink
RouteLLMFramework for serving and evaluating LLM routers - save LLM costs without compromising quality. Drop-in replacement for OpenAI's client to route simpler queries to cheaper models.Link
<p align = "center"> <b> Memory </b> </p>
LibraryDescriptionLink
mem0The Memory layer for your AI apps.Link
MemoripyAn AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.Link
Letta (MemGPT)An open-source framework for building stateful LLM applications with advanced reasoning capabilities and transparent long-term memoryLink
MemobaseA user profile-based memory system designed to bring long-term user memory to your Generative AI applications.Link
<p align = "center"> <b> Interface </b> </p>
LibraryDescriptionLink
StreamlitA faster way to build and share data apps. Streamlit lets you transform Python scripts into interactive web apps in minutesLink
GradioBuild and share delightful machine learning apps, all in Python.Link
AI SDK UIBuild chat and generative user interfaces.Link
AI-GradioCreate AI apps powered by various AI providers.Link
SimpleaichatPython package for easily interfacing with chat apps, with robust features and minimal code complexity.Link
ChainlitBuild production-ready Conversational AI applications in minutes.Link
<p align = "center"> <b> Low Code </b> </p>
LibraryDescriptionLink
LangFlowLangFlow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.Link
<p align = "center"> <b> Cache </b> </p>
LibraryDescriptionLink
GPTCacheA Library for Creating Semantic Cache for LLM Queries. Slash Your LLM API Costs by 10x 💰, Boost Speed by 100x. Fully integrated with LangChain and LlamaIndex.Link

LLM RAG

LibraryDescriptionLink
FastGraph RAGStreamlined and promptable Fast GraphRAG framework designed for interpretable, high-precision, agent-driven retrieval workflows.Link
ChonkieRAG chunking library that is lightweight, lightning-fast, and easy to use.Link
RAGCheckerA Fine-grained Framework For Diagnosing RAG.Link
RAG to RichesBuild, scale, and deploy state-of-the-art Retrieval-Augmented Generation applications.Link
BeyondLLMBeyond LLM offers an all-in-one toolkit for experimentation, evaluation, and deployment of Retrieval-Augmented Generation (RAG) systems.Link
SQLite-VecA vector search SQLite extension that runs anywhere!Link
fastRAGfastRAG is a research framework for efficient and optimized retrieval-augmented generative pipelines, incorporating state-of-the-art LLMs and Information Retrieval.Link
FlashRAGA Python Toolkit for Efficient RAG Research.Link
LlmwareUnified framework for building enterprise RAG pipelines with small, specialized models.Link
RerankersA lightweight unified API for various reranking models.Link
VectaraBuild Agentic RAG applications.Link

LLM Inference

LibraryDescriptionLink
llama.cppLLM inference in C/C++.Link
OllamaLocal LLM inference.Link
vLLMHigh-throughput and memory-efficient inference and serving engine for LLMs.Link
TensorRT-LLMTensorRT-LLM is a library for optimizing Large Language Model (LLM) inference.Link
WebLLMHigh-performance In-browser LLM Inference Engine.Link
LLM CompressorTransformers-compatible library for applying various compression algorithms to LLMs for optimized deployment.Link
LightLLMPython-based LLM inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.Link
torchchatRun PyTorch LLMs locally on servers, desktop, and mobile.Link

LLM Serving

LibraryDescriptionLink
LangcornServing LangChain LLM apps and agents automagically with FastAPI.Link
LitServeLightning-fast serving engine for any AI model of any size. It augments FastAPI with features like batching, streaming, and GPU autoscaling.Link

LLM Data Extraction

LibraryDescriptionLink
Crawl4AIOpen-source LLM Friendly Web Crawler & Scraper.Link
ScrapeGraphAIA web scraping Python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.).Link
DoclingDocling parses documents and exports them to the desired format with ease and speed.Link
Llama ParseGenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents).Link
PyMuPDF4LLMPyMuPDF4LLM library makes it easier to extract PDF content in the format you need for LLM & RAG environments.Link
CrawleeA web scraping and browser automation library.Link
MegaParseParser for every type of document.Link
ExtractThinkerDocument Intelligence library for LLMs.Link

LLM Data Generation

LibraryDescriptionLink
DataDreamerDataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows.Link
fabricatorA flexible open-source framework to generate datasets with large language models.Link
PromptwrightSynthetic Dataset Generation Library.Link
EasyInstructAn Easy-to-use Instruction Processing Framework for Large Language Models.Link

LLM Agents

LibraryDescriptionLink
CrewAIFramework for orchestrating role-playing, autonomous AI agents.Link
LangGraphBuild resilient language agents as graphs.Link
AgnoBuild AI Agents with memory, knowledge, tools, and reasoning. Chat with them using a beautiful Agent UI.Link
Agents SDKBuild agentic apps using LLMs with context, tools, hand off to other specialized agents.Link
AutoGenAn open-source framework for building AI agent systems.Link
SmolagentsLibrary to build powerful agents in a few lines of code.Link
Pydantic AIPython agent framework to build production grade applications with Generative AI.Link
CAMELOpen-source multi-agent framework with various toolkits and use-cases available.Link
BeeAIBuild production-ready multi-agent systems in Python.Link
gradio-toolsA Python library for converting Gradio apps into tools that can be leveraged by an LLM-based agent to complete its task.Link
ComposioProduction Ready Toolset for AI Agents.Link
Atomic AgentsBuilding AI agents, atomically.Link
MemaryOpen Source Memory Layer For Autonomous Agents.Link
Browser UseMake websites accessible for AI agents.Link
OpenWebAgentAn Open Toolkit to Enable Web Agents on Large Language Models.Link
LagentA lightweight framework for building LLM-based agents.Link
LazyLLMA Low-code Development Tool For Building Multi-agent LLMs Applications.Link
SwarmsThe Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework.Link
ChatArenaChatArena is a library that provides multi-agent language game environments and facilitates research about autonomous LLM agents and their social interactions.Link
SwarmEducational framework exploring ergonomic, lightweight multi-agent orchestration.Link
AgentStackThe fastest way to build robust AI agents.Link
ArchgwIntelligent gateway for Agents.Link
FlowA lightweight task engine for building AI agents.Link
AgentOpsPython SDK for AI agent monitoring.Link
LangroidMulti-Agent framework.Link
AgentariumFramework for creating and managing simulations populated with AI-powered agents.Link
UpsonicReliable AI agent framework that supports MCP.Link

LLM Evaluation

LibraryDescriptionLink
RagasRagas is your ultimate toolkit for evaluating and optimizing Large Language Model (LLM) applications.Link
GiskardOpen-Source Evaluation & Testing for ML & LLM systems.Link
DeepEvalLLM Evaluation FrameworkLink
LightevalAll-in-one toolkit for evaluating LLMs.Link
TrulensEvaluation and Tracking for LLM ExperimentsLink
PromptBenchA unified evaluation framework for large language models.Link
LangTestDeliver Safe & Effective Language Models. 60+ Test Types for Comparing LLM & NLP Models on Accuracy, Bias, Fairness, Robustness & More.Link
EvalPlusA rigorous evaluation framework for LLM4Code.Link
FastChatAn open platform for training, serving, and evaluating large language model-based chatbots.Link
judgesA small library of LLM judges.Link
EvalsEvals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.Link
AgentEvalsEvaluators and utilities for evaluating the performance of your agents.Link
LLMBoxA comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.Link
OpikAn open-source end-to-end LLM Development Platform which also includes LLM evaluation.Link
PydanticAI EvalsA powerful evaluation framework designed to help you systematically evaluate the performance of LLM applications.Link
UQLMA Python package for generation-time, zero-resource LLM hallucination using state-of-the-art uncertainty quantification techniques.Link

LLM Monitoring

LibraryDescriptionLink
MLflowAn open-source end-to-end MLOps/LLMOps Platform for tracking, evaluating, and monitoring LLM applications.Link
OpikAn open-source end-to-end LLM Development Platform which also includes LLM monitoring.Link
LangSmithProvides tools for logging, monitoring, and improving your LLM applications.Link
Weights & Biases (W&B)W&B provides features for tracking LLM performance.Link
HeliconeOpen source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc.Link
EvidentlyAn open-source ML and LLM observability framework.Link
PhoenixAn open-source AI observability platform designed for experimentation, evaluation, and troubleshooting.Link
ObserversA Lightweight Library for AI Observability.Link

LLM Prompts

LibraryDescriptionLink
PCToolkitA Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models.Link
Selective ContextSelective Context compresses your prompt and context to allow LLMs (such as ChatGPT) to process 2x more content.Link
LLMLinguaLibrary for compressing prompts to accelerate LLM inference.Link
betterpromptTest suite for LLM prompts before pushing them to production.Link
PromptifySolve NLP Problems with LLMs & easily generate different NLP Task prompts for popular generative models like GPT, PaLM, and more with Promptify.Link
PromptSourcePromptSource is a toolkit for creating, sharing, and using natural language prompts.Link
DSPyDSPy is the open-source framework for programming—rather than prompting—language models.Link
Py-priomptPrompt design library.Link
PromptimizerPrompt optimization library.Link

LLM Structured Outputs

LibraryDescriptionLink
InstructorPython library for working with structured outputs from large language models (LLMs). Built on top of Pydantic, it provides a simple, transparent, and user-friendly API.Link
XGrammarAn open-source library for efficient, flexible, and portable structured generation.Link
OutlinesRobust (structured) text generationLink
GuidanceGuidance is an efficient programming paradigm for steering language models.Link
LMQLA language for constraint-guided and efficient LLM programming.Link
JsonformerA Bulletproof Way to Generate Structured JSON from Language Models.Link

LLM Safety and Security

LibraryDescriptionLink
JailbreakEvalA collection of automated evaluators for assessing jailbreak attempts.Link
EasyJailbreakAn easy-to-use Python framework to generate adversarial jailbreak prompts.Link
GuardrailsAdding guardrails to large language models.Link
LLM GuardThe Security Toolkit for LLM Interactions.Link
AuditNLGAuditNLG is an open-source library that can help reduce the risks associated with using generative AI systems for language.Link
NeMo GuardrailsNeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.Link
GarakLLM vulnerability scannerLink
DeepTeamThe LLM Red Teaming FrameworkLink

LLM Embedding Models

LibraryDescriptionLink
Sentence-TransformersState-of-the-Art Text EmbeddingsLink
Model2VecFast State-of-the-Art Static EmbeddingsLink
Text Embedding InferenceA blazing fast inference solution for text embeddings models. TEI enables high-performance extraction for the most popular models, including FlagEmbedding, Ember, GTE and E5.Link

Others

LibraryDescriptionLink
Text MachinaA modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for MGT-related tasks such as detection, attribution, and boundary detection.Link
LLM ReasonersA library for advanced large language model reasoning.Link
EasyEditAn Easy-to-use Knowledge Editing Framework for Large Language Models.Link
CodeTFCodeTF: One-stop Transformer Library for State-of-the-art Code LLM.Link
spacy-llmThis package integrates Large Language Models (LLMs) into spaCy, featuring a modular system for fast prototyping and prompting, and turning unstructured responses into robust outputs for various NLP tasks.Link
pandas-aiChat with your database (SQL, CSV, pandas, polars, MongoDB, NoSQL, etc.).Link
LLM Transparency ToolAn open-source interactive toolkit for analyzing internal workings of Transformer-based language models.Link
VannaChat with your SQL database. Accurate Text-to-SQL Generation via LLMs using RAG.Link
mergekitTools for merging pretrained large language models.Link
MarkLLMAn Open-Source Toolkit for LLM Watermarking.Link
LLMSanitizeAn open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).Link
AnnotateaiAutomatically annotate papers using LLMs.Link
LLM ReasonerMake any LLM think like OpenAI o1 and DeepSeek R1.Link

⭐️ Star History

Star History Chart

Please consider giving a star, if you find this repository useful.