If you're looking for a place to search and compare large language models based on their performance and attributes, LLM Explorer is the most complete option. It's got a gargantuan catalog of more than 35,000 open-source models, filtered by attributes like size, benchmark scores and memory usage. The site offers categorized lists, benchmarks, graphs and detailed model descriptions so AI enthusiasts and pros can find the right models for their needs.
Another option is Airtrain AI, which offers an LLM Playground to try out more than 27 models, including both open-source and proprietary ones. It also has a Dataset Explorer for visualizing and clustering data and AI Scoring to test models based on your own task descriptions. With free and paid options, Airtrain AI is designed to make large language models more accessible and less expensive for quick deployment.
For those who want to oversee and optimize LLM app development, Humanloop offers a collaborative playground for developers and product managers. It includes tools to manage prompts, evaluate results and monitor progress and integrates with several LLM providers. The site supports multiple programming languages and is designed to improve productivity and collaboration for AI feature development.
Finally, HoneyHive offers a single environment for collaboration, testing and evaluation of LLM apps. With automated CI testing, observability and prompt management, it's good for a variety of use cases from debugging to data analysis. The site supports more than 100 models through integrations with popular GPU clouds and offers flexible pricing options for individuals and enterprises.