Showing 28 of 28 projects
A comprehensive open-source guide covering prompt engineering techniques, papers, notebooks, and resources for LLMs, RAG, and AI agents.
TensorFlow implementation and pre-trained models for BERT, a bidirectional Transformer for language understanding.
A TypeScript toolkit for building AI-powered applications and agents with React, Next.js, and Node.js support.
A TypeScript toolkit for building AI-powered applications and agents with React, Next.js, and Node.js support.
Open-source AI platform for building private agents, assistants, and enterprise search with document analysis and multi-model support.
An all-in-one AI framework for semantic search, LLM orchestration, and language model workflows built around an embeddings database.
Fast, state-of-the-art tokenizers for training and tokenization, optimized for both research and production.
An open platform for deploying and using language agents for data analysis, plugin automation, and web browsing.
A Go CLI tool that creates a Telegram bot to interact with ChatGPT, deployable with a single command.
An open-source, locally-runnable code completion engine using large language models that works on CPU.
A JAX/Flax-based framework for easy and scalable pre-training, fine-tuning, evaluation, and serving of large language models.
An experimental toolkit that automatically generates and maintains codebase documentation using LLMs like GPT-4.
A BERT language model pre-trained on a large corpus of scientific papers for natural language processing tasks in scientific domains.
A GPT-2 variant that generates plausible fake words, definitions, and usage examples from scratch.
TensorFlow implementation of character-aware neural language models using CNN, highway networks, and LSTM.
A benchmark for evaluating protein language models through five biologically relevant semi-supervised learning tasks.
Open-source language models and tools for protein engineering and design using AI.
A BERT model pre-trained on PubMed abstracts and clinical notes for biomedical natural language processing tasks.
A foundation model for multi-species genome understanding, achieving state-of-the-art performance on 28 genomic tasks.
A BERT-based language model pretrained on clinical notes for predicting hospital readmissions and analyzing medical text.
A collection of genomic language models for predicting variant effects and evolutionary constraints from DNA sequences.
A Python library that simplifies using, finetuning, and deploying state-of-the-art machine learning models for various AI tasks.
A pure Go library for fast, offline natural language detection supporting 29 languages.
A collection of pre-trained BERT, DistilBERT, ELECTRA, GPT-2, and ConvBERT models for multiple languages, including German, Italian, Turkish, and historic texts.
Wrap the Gemini CLI as an OpenAI-compatible API service to use the free Gemini Pro model via standard API calls.
A GPT-2 model trained from scratch on password leaks for password modeling, generation, and strength estimation.
A GPT-2 style transformer language model implemented from scratch in Rust for educational purposes.
A community provider for the Vercel AI SDK that enables using Google's Gemini models through the Gemini CLI and Google Cloud Code endpoints.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.