Stars
421
Forks
111
Language
Python
Last Updated
Apr 22, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 5 | Biobank: large scale biomedical computation on OMERO | May 28, 2022 | |
Python | 24 | Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling) | Jun 21, 2022 | |
Python | 3 | Tools and training scripts for large language models | Mar 28, 2023 | |
HTML | 4 | Data Management in large-scale education research training series | Feb 04, 2023 | |
None | 15 | Data from the publication "Multi-Domain Goal-Oriented Dialogues (MultiDoGO): Strategies toward Curating and Annotating Large Scale … | Apr 19, 2022 | |
Python | 2 | Paddle Large Scale Classification Tools | Jul 18, 2022 | |
Jupyter Notebook | 8 | BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance. | May 28, 2023 | |
None | 112 | Collection of training data management explorations for large language models | Jan 18, 2024 | |
Python | 2 | Tools for converting .mid files into text for training large language models | May 22, 2023 | |
Python | 2 | Components for training large language models | Nov 22, 2022 | |
Python | 68 | Evaluation suite for large-scale language models. | Jul 07, 2022 | |
Python | 2 | Some tools for large-scale video processing | May 29, 2023 | |
Python | 5 | analysis tools for large-scale neural recordings | Jul 15, 2023 | |
C++ | 849 | Scalable, fast, and lightweight system for large-scale topic modeling | Jun 12, 2022 | |
Python | 5 | PyTorch extensions for high performance and large scale training. | Jul 24, 2021 | |
Python | 1842 | PyTorch extensions for high performance and large scale training. | Aug 19, 2022 | |
Python | 102 | Galileo library for large scale graph training by JD | Jun 14, 2022 | |
None | 2 | PyTorch extensions for high performance and large scale training. | Sep 22, 2021 | |
None | 3 | EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data" | May 27, 2022 | |
Python | 1729 | The RedPajama-Data repository contains code for preparing large datasets for training large language models. | Apr 24, 2023 | |
Python | 3 | tools for analysis of diffraction images and large-scale image data processing | Mar 27, 2020 | |
C++ | 4 | Large-scale performance evaluation of knowledge graphs embeddings in the biomedical domain | May 23, 2022 | |
Python | 92 | Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large Language Model for Diverse Biomedical Tasks | Jan 17, 2024 | |
Python | 27 | DeepGNN is a framework for training machine learning models on large scale graph data. | Aug 05, 2022 | |
None | 2 | NCATS Biomedical Data Translator repository for shared, large file access | Jun 11, 2023 | |
Python | 872 | Evolutionary Scale Modeling (esm): Pretrained language models for proteins | Aug 10, 2022 | |
Python | 1044 | Unsupervised Language Modeling at scale for robust sentiment classification | Jun 30, 2022 | |
Python | 5 | Unsupervised Language Modeling at scale for robust sentiment classification | Aug 14, 2022 | |
Python | 2 | Evolutionary Scale Modeling (esm): Pretrained language models for proteins | Jan 20, 2024 | |
Python | 21 | Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT … | Jan 27, 2023 | |
Python | 110 | Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER … | Mar 16, 2023 | |
Python | 4 | large language model training and deploy | Apr 25, 2023 | |
Java | 3 | Hadoop library for large-scale data processing | Feb 17, 2021 | |
Scala | 4 | Training Large Scale Statistical Machine Translation Models on Spark | Aug 18, 2019 | |
Python | 3 | Training Large-scale Text Embedding Models with 🤗 Transformers | Aug 26, 2023 | |
Vue | 3 | Web platform for easily managing, curating, and sharing FAIR and AI-ready clinical and biomedical research … | Feb 28, 2023 | |
Java | 16 | An object-oriented language for modeling large-scale neural systems, along with an IDE for writing and … | May 24, 2023 | |
Shell | 7 | Large-scale Data Analysis supplementary material. | Apr 05, 2023 | |
Python | 260 | BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training | Aug 31, 2022 | |
Python | 418 | Open-AI's DALL-E for large scale training in mesh-tensorflow. | Aug 03, 2022 | |
None | 2 | Open-AI's DALL-E for large scale training in mesh-tensorflow. | Mar 15, 2023 | |
Python | 20 | EasyRobust: An Easy-to-use Framework for Large-scale Robust Training | Aug 12, 2022 | |
Jupyter Notebook | 38 | Input pipelines for large scale, sharded training of deep learning models. | Sep 10, 2021 | |
Python | 183 | LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training | Aug 03, 2022 | |
Python | 111 | Voxel-MAE: Masked Autoencoders for Pre-training Large-scale Point Clouds | Oct 19, 2022 | |
Python | 660 | :book: Some language modeling tools for Keras | Jan 28, 2023 | |
None | 5 | A Data Modeling Language | Apr 27, 2022 | |
CSS | 61 | Lectures for INFO8002 - Large-scale Data Systems, ULiège | Mar 06, 2023 | |
C++ | 13 | Aligner for large scale serial section image data | Jan 18, 2023 | |
Python | 34 | large language model training-3-stages+deployment | Dec 11, 2023 |