Stars
2060
Forks
203
Language
Python
Last Updated
Dec 08, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
None | 2 | Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, … | May 11, 2023 | |
None | 2 | Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, … | Jun 29, 2023 | |
Python | 37 | Pre-training script for BART in JAX/Flax | Aug 11, 2022 | |
Python | 3 | 🔐 Serialize JAX/Flax models with `safetensors` | Dec 24, 2022 | |
JavaScript | 2 | recursive dataset gen for finetuning pre-trained GPT models from large text | May 21, 2023 | |
None | 4 | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna … | Oct 10, 2023 | |
Python | 25 | Shared code for training sentence embeddings with Flax / JAX | Feb 14, 2023 | |
None | 2 | An open platform for training, serving, and evaluating large language model based chatbots. | Apr 29, 2023 | |
None | 5 | Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal | Mar 27, 2023 | |
Python | 2 | An open platform for training, serving, and evaluating large language model for tool learning. | May 28, 2023 | |
Python | 107 | Finetuning large language models for GDScript generation. | Apr 24, 2023 | |
Python | 150 | Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc. | Aug 19, 2022 | |
None | 3 | Large Language Models(LLMs) of Code | Apr 24, 2023 | |
Python | 20 | Pre-training BART in Flax on The Pile dataset | Jan 28, 2023 | |
Python | 3 | Implementation of Denoising Diffusion Probabilistic Models (DDPM) in JAX and Flax. | Aug 30, 2023 | |
Python | 585 | Notebooks for Large Language Models (LLMs) Specialization | Jan 17, 2024 | |
Python | 116 | Train very large language models in Jax. | Jan 23, 2023 | |
None | 2 | Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax. | Jan 18, 2023 | |
Jupyter Notebook | 15 | Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax. | Jun 05, 2022 | |
Python | 2 | Distributed Decentralized Platform for Training, Finetuning, and Utilizing AI Models! | Sep 01, 2023 | |
Python | 3 | Training, testing, and evaluating machine learning classifier models | Apr 10, 2018 | |
Jupyter Notebook | 2 | training and evaluating ml models on sleep data | Dec 11, 2023 | |
Jupyter Notebook | 6 | Normalizing flow models allowing for a conditioning context, implemented using Jax, Flax, and Distrax. | Jan 21, 2023 | |
Python | 1060 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Oct 16, 2022 | |
Python | 3 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Apr 20, 2021 | |
None | 2 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Nov 05, 2021 | |
None | 2 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Nov 15, 2023 | |
None | 2 | Pre-Training of German T5 models | Aug 29, 2023 | |
Python | 32 | 4-Bit Finetuning of Large Language Models on One Consumer GPU | May 12, 2023 | |
Jupyter Notebook | 14 | A framework for creating grounded instruction based datasets and training conversational domain expert Large Language … | May 10, 2023 | |
Python | 2 | Components for training large language models | Nov 22, 2022 | |
Python | 4 | Collection of healthcare-inspired prompts for Large Language Models (LLMs) | Apr 26, 2023 | |
Ballerina | 2 | Ballerina ReAct type Agent module using Large language models (LLMs) | Jun 02, 2023 | |
JavaScript | 115 | Run Large-Language Models (LLMs) 🚀 directly in your browser! | Jan 16, 2024 | |
HTML | 19 | Qualitative data coding and analysis using Large Language Models (LLMs) | Dec 31, 2023 | |
Jupyter Notebook | 5 | Cookiecutter for evaluating algorithms and training models in nussl | Aug 09, 2022 | |
Python | 2 | Code for pre-training BabyLM baseline models. | May 28, 2023 | |
Python | 196 | TensorFlow framework for training and serving machine learning models | Mar 15, 2022 | |
Jupyter Notebook | 17 | Cryptocurrency forecasting 📈 training and serving models made automatic | Apr 20, 2023 | |
PDDL | 143 | An extensible benchmark for evaluating large language models on planning | Jan 17, 2024 | |
Python | 3728 | An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Language Model for … | Apr 09, 2023 | |
Python | 12 | A solution for on-demand training and serving of Machine Learning models, using Azure Databricks and … | Dec 20, 2020 | |
Python | 2 | A solution for on-demand training and serving of Machine Learning models, using Azure Databricks and … | Jul 22, 2019 | |
Python | 188 | EVA: Large-scale Pre-trained Chit-Chat Models | Oct 16, 2022 | |
Python | 10 | Question-answering on your own data with Large Language Models (LLMs) | Mar 26, 2023 | |
Python | 48 | A framework to empower quantitative modeling using Large Language Models (LLMs) | May 24, 2023 | |
Python | 8 | Leverage hallucinations from Large Language Models (LLMs) for novelty-driven explorations. | May 16, 2023 | |
Python | 786 | A generalized information-seeking agent system with Large Language Models (LLMs). | Jan 18, 2024 | |
Python | 88 | Efficient Training (including pre-training and fine-tuning) for Big Models | Aug 12, 2022 | |
Python | 352 | Code for the paper "Evaluating Large Language Models Trained on Code" | Aug 11, 2022 |