full stack deep learning - llm bootcamp: https://fullstackdeeplearning.com/llm-bootcamp/spring-2023/
- prototyping
- nat.dev
- https://play.vercel.ai/
- prompt engineering
- https://github.com/ianarawjo/ChainForge An open-source visual programming environment for LLM experimentation and prompt evaluation.
- alternatives
- https://github.com/logspace-ai/langflow a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
- https://github.com/FlowiseAI/Flowise - visual langchain builder
- vellum.ai has a visual flow editor thing. tools for prompt engineering, semantic search, version control, quantitative testing, and performance monitoring.
- https://github.com/ianarawjo/ChainForge An open-source visual programming environment for LLM experimentation and prompt evaluation.
- owning the endpoint
- OpenLM - https://github.com/r2d4/openlm OpenAI-compatible Python client that can call any LLM
- SDK wrappers
- prompt tooling
- langchain
- llamaindex
- deepset haystack
- guardrails
- scale spellbook
- vector databases
- portkey https://twitter.com/jumbld/status/1648684887988117508?s=46&t=90xQ8sGy63D2OtiaoGJuww
- helicone
- Ozone - prompt unit testing https://twitter.com/at_sushi_/status/1667004844153131008
- https://log10.io/ - pivoting to llm quality monitoring
- eval
- https://github.com/BerriAI/bettertest https://twitter.com/ishaan_jaff/status/1665105582804832258
- https://github.com/AgentOps-AI/agentops
- Baserun.ai
- https://hegel-ai.com, https://www.vellum.ai/, https://www.parea.ai, http://baserun.ai, https://www.trychatter.ai, https://talc.ai, https://github.com/BerriAI/bettertest, https://langfuse.com
- https://github.com/mr-gpt/deepeval
- Hegel AI Prompttools
- https://github.com/promptfoo/promptfoo
- https://benchllm.com/ - https://www.getscorecard.ai - https://arxiv.org/abs/2308.03688 - https://withmartian.com - https://aihero.studio/
- "LLM observability": Baserun, Athina, LangSmith, Parea, Arize, Langfuse
- evals
- data quality
- cleanlab.ai
- deepchecks <- bigger
- lilac ai
- gallileo
- https://github.com/BerriAI/litellm
- martian
- openrouter
- Microsoft TypeChat https://news.ycombinator.com/item?id=36803124
- jsonformer
- lmql
- hugginface transformers https://huggingface.co/learn/nlp-course/chapter0/1?fw=pt
- lightning https://twitter.com/_willfalcon/status/1665826619200614401
- vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention (github) - PagedAttention is super efficient for production workloads and they do a great job with dynamic batching, queuing requests, etc.
- Skypilot https://github.com/skypilot-org/skypilot a framework for running LLMs, AI, and batch jobs on any cloud, offering maximum cost savings, highest GPU availability, and managed execution.
SkyPilot abstracts away cloud infra burdens:
- Launch jobs & clusters on any cloud
- Easy scale-out: queue and run many jobs, automatically managed
- Easy access to object stores (S3, GCS, R2)
comparisons https://thedataquarry.com/posts/vector-db-1/
https://news.ycombinator.com/item?id=36943318
- chroma
- pinecone
- weaviate
- qdrant
- marqo vector search
- postgres
- supabase vector
- problems with it:
https://twitter.com/nirantk/status/1674110063286571008?s=46
https://nextword.substack.com/p/vector-database-is-not-a-separate
- Cloudflare launches vectorize, announced on September 27th, 2023
- MongoDB Atlas Vector Search launched on June 22nd, 2023
- Databricks announced on June 28th, 2023
- Oracle integrated vector database announced on September 19th, 2023
- IBM to announce vector database preview in Q4 2023
- of course, companies such as Elastic and Microsoft already had vector DB offerings much earlier.
ETL
- psychic.dev
fully vertically integrated RAG cloud
- vectara -29m raised and from former cloudera founder
- https://pezzo.ai - "enables you to build, test, monitor and instantly ship AI all in one platform, while constantly optimizing for cost and performance." - used by Meltwater CTO - from shack15
- https://www.pulze.ai maybe?
- https://mlfoundry.com/
- together.ai
- model hosting and finetuning
- LLM Engine (https://llm-engine.scale.com) at Scale, which is our open source, self-hostable framework for open source LLM inference and fine-tuning. (source)
- replicate
- https://github.com/danielgross/localpilot
- https://github.com/continuedev/continue
- https://github.com/mudler/LocalAI
- https://vxtwitter.com/ex3ndr/status/1726863029919482167
- AI relational database https://github.com/georgia-tech-db/eva
- finetune industry
- AI devtools
- codegen.ai
- safety/security
- openai moderation endpoint (free)
- meta llamaguard/purple llama https://arxiv.org/abs/2312.06674