Stars
12
Forks
0
Language
None
Last Updated
May 04, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 68 | Evaluation suite for large-scale language models. | Jul 07, 2022 | |
Python | 20 | Evaluation of the Open-Assistant language models | Apr 28, 2023 | |
Jupyter Notebook | 2 | Google Sanskrit translation evaluation using language models | Oct 13, 2023 | |
Python | 2 | MMLU evaluation for language models served with EasyLM | Apr 12, 2023 | |
None | 2 | Portuguese Language Unraid Repo | Apr 28, 2023 | |
Python | 600 | Portuguese pre-trained BERT models | Sep 10, 2022 | |
Python | 18 | Evaluation of language models on mono- or multilingual Scandinavian language tasks. | Apr 20, 2023 | |
Python | 4 | A framework for few-shot evaluation of autoregressive language models. | Nov 21, 2021 | |
Python | 13 | A framework for few-shot evaluation of autoregressive language models. | Jul 12, 2022 | |
Python | 315 | A framework for few-shot evaluation of autoregressive language models. | Aug 10, 2022 | |
Python | 10 | A framework for few-shot evaluation of autoregressive language models. | Jan 28, 2023 | |
None | 8 | A framework for few-shot evaluation of autoregressive language models. | Apr 18, 2023 | |
Python | 2 | A framework for few-shot evaluation of autoregressive language models. | Mar 07, 2023 | |
Python | 2 | A framework for few-shot evaluation of autoregressive language models. | Jun 12, 2023 | |
Python | 5 | A framework for few-shot evaluation of autoregressive language models. | Jun 16, 2023 | |
None | 8 | Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models" | May 24, 2023 | |
R | 15 | Stemming Algorithms for the Portuguese Language | Apr 21, 2022 | |
Jupyter Notebook | 2 | Benchmark dataset and evaluation for large language models that generate code | Apr 29, 2022 | |
Python | 47 | A framework for the evaluation of autoregressive code generation language models. | Dec 01, 2022 | |
Python | 42 | Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models … | Nov 20, 2022 | |
Jupyter Notebook | 2 | Evaluation of regression models | Aug 17, 2021 | |
Python | 4 | Evaluation of finetuned models. | Feb 22, 2023 | |
Python | 3 | Some code for evaluation and a leaderboard of instruction following language models | Jun 19, 2023 | |
None | 9 | random sarcasm and jokes at portuguese language | Mar 18, 2022 | |
R | 15 | A Stemming Algorithm for the Portuguese Language | Jul 21, 2022 | |
Jupyter Notebook | 26 | Biomedical and Clinical BERT for Portuguese Language | Jul 25, 2022 | |
Jupyter Notebook | 2 | Offensive Language Identification Dataset for Brazilian Portuguese. | Oct 18, 2023 | |
Jupyter Notebook | 11 | Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity … | Nov 11, 2022 | |
None | 5 | an ML & deep learning algorithms/models to assess spoken English language proficiency +++ it transforms … | Jul 22, 2022 | |
R | 4 | evaluation of global climate models | Jun 29, 2022 | |
HTML | 4 | Evaluation interfaces for generative models | Apr 04, 2021 | |
Python | 2 | Evaluation tasks for ObjectNav models | Jan 09, 2023 | |
Python | 5 | Evaluation of Finnish generative models | Apr 17, 2023 | |
Python | 7 | Trainer and Evaluation scripts for fine-tuning Whisper models for the Ukrainian language | Apr 20, 2023 | |
Python | 247 | Used for adaptive human in the loop evaluation of language and embedding models. | Apr 22, 2023 | |
Python | 2 | AAAI ICWSM-2022 Evaluation of Fake News Detection with Knowledge-Enhanced Language Models | Dec 30, 2022 | |
Python | 7 | Command-line tool and Python API for targeted syntactic evaluation of language models | Dec 31, 2021 | |
None | 5 | Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". | Jun 26, 2023 | |
Java | 2 | ELUE (Efficient Language Understanding Evaluation) is a standard benchmark for efficient NLP models. | Nov 03, 2022 | |
Python | 1633 | Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard | Sep 12, 2022 | |
None | 54 | A large-scale complex question answering evaluation of ChatGPT and similar large-language models | Mar 21, 2023 | |
Python | 2 | Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard | Feb 26, 2023 | |
None | 3 | A large-scale complex question answering evaluation of ChatGPT and similar large-language models | May 07, 2023 | |
Python | 38 | LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation | May 22, 2023 | |
Python | 10 | Forecasting models, development, evaluation, and validation | Jan 21, 2022 | |
Java | 2 | Performance Evaluation of Machine Learning Models | Apr 11, 2022 | |
Python | 8 | Evaluation tool for object detection models | Apr 06, 2023 | |
Python | 4 | Evaluation metrics for scvi-tools models | Feb 14, 2023 | |
Python | 2 | Automatic Question Generation & Difficulty Control for the Portuguese Language | May 06, 2023 | |
Java | 60 | Clinical Quality Language Evaluation Engine | Dec 21, 2022 |