eplm

Evaluation of Portuguese Language Models

Stars

12

Forks

0

Language

None

Last Updated

May 04, 2023

Similar Repos

Repo	Language	Stars	Description	Updated At
lm-evaluation	Python	68	Evaluation suite for large-scale language models.	Jul 07, 2022
oasst-model-eval	Python	20	Evaluation of the Open-Assistant language models	Apr 28, 2023
Google-Sanskrit-translate-evaluation	Jupyter Notebook	2	Google Sanskrit translation evaluation using language models	Oct 13, 2023
mmlu_easylm	Python	2	MMLU evaluation for language models served with EasyLM	Apr 12, 2023
lang-pt_PT	None	2	Portuguese Language Unraid Repo	Apr 28, 2023
portuguese-bert	Python	600	Portuguese pre-trained BERT models	Sep 10, 2022
ScandEval	Python	18	Evaluation of language models on mono- or multilingual Scandinavian language tasks.	Apr 20, 2023
lm-evaluation-harness	Python	4	A framework for few-shot evaluation of autoregressive language models.	Nov 21, 2021
lm-evaluation-harness	Python	13	A framework for few-shot evaluation of autoregressive language models.	Jul 12, 2022
lm-evaluation-harness	Python	315	A framework for few-shot evaluation of autoregressive language models.	Aug 10, 2022
lm-evaluation-harness	Python	10	A framework for few-shot evaluation of autoregressive language models.	Jan 28, 2023
lm-evaluation-harness	None	8	A framework for few-shot evaluation of autoregressive language models.	Apr 18, 2023
lm-evaluation-harness	Python	2	A framework for few-shot evaluation of autoregressive language models.	Mar 07, 2023
lm-evaluation-harness	Python	2	A framework for few-shot evaluation of autoregressive language models.	Jun 12, 2023
lm-evaluation-harness	Python	5	A framework for few-shot evaluation of autoregressive language models.	Jun 16, 2023
AttrScore	None	8	Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"	May 24, 2023
ptstem	R	15	Stemming Algorithms for the Portuguese Language	Apr 21, 2022
nlcc-data	Jupyter Notebook	2	Benchmark dataset and evaluation for large language models that generate code	Apr 29, 2022
bigcode-evaluation-harness	Python	47	A framework for the evaluation of autoregressive code generation language models.	Dec 01, 2022
helm	Python	42	Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models …	Nov 20, 2022
Regression-Analysis-on-Avocado-prices	Jupyter Notebook	2	Evaluation of regression models	Aug 17, 2021
AlexandraAI-eval	Python	4	Evaluation of finetuned models.	Feb 22, 2023
ilm-eval	Python	3	Some code for evaluation and a leaderboard of instruction following language models	Jun 19, 2023
portuguese_jokes	None	9	random sarcasm and jokes at portuguese language	Mar 18, 2022
rslp	R	15	A Stemming Algorithm for the Portuguese Language	Jul 21, 2022
BioBERTpt	Jupyter Notebook	26	Biomedical and Clinical BERT for Portuguese Language	Jul 25, 2022
olid-br	Jupyter Notebook	2	Offensive Language Identification Dataset for Brazilian Portuguese.	Oct 18, 2023
elmo	Jupyter Notebook	11	Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity …	Nov 11, 2022
Speech-Rater	None	5	an ML & deep learning algorithms/models to assess spoken English language proficiency +++ it transforms …	Jul 22, 2022
gcmeval	R	4	evaluation of global climate models	Jun 29, 2022
evaluation-interfaces	HTML	4	Evaluation interfaces for generative models	Apr 04, 2021
object-nav-eval	Python	2	Evaluation tasks for ObjectNav models	Jan 09, 2023
finnish-generative-model-eval	Python	5	Evaluation of Finnish generative models	Apr 17, 2023
whisper-ukrainian	Python	7	Trainer and Evaluation scripts for fine-tuning Whisper models for the Ukrainian language	Apr 20, 2023
cheese	Python	247	Used for adaptive human in the loop evaluation of language and embedding models.	Apr 22, 2023
fake-news-detection	Python	2	AAAI ICWSM-2022 Evaluation of Fake News Detection with Knowledge-Enhanced Language Models	Dec 30, 2022
syntaxgym-core	Python	7	Command-line tool and Python API for targeted syntactic evaluation of language models	Dec 31, 2021
FactKB	None	5	Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge".	Jun 26, 2023
Efficient-Language-Understanding-Evaluation	Java	2	ELUE (Efficient Language Understanding Evaluation) is a standard benchmark for efficient NLP models.	Nov 03, 2022
ChineseGLUE	Python	1633	Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard	Sep 12, 2022
Complex-Question-Answering-Evaluation-of-ChatGPT	None	54	A large-scale complex question answering evaluation of ChatGPT and similar large-language models	Mar 21, 2023
chineseGLUE	Python	2	Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard	Feb 26, 2023
Complex-Question-Answering-Evaluation-of-ChatGPT	None	3	A large-scale complex question answering evaluation of ChatGPT and similar large-language models	May 07, 2023
LLMScore	Python	38	LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation	May 22, 2023
forecasting	Python	10	Forecasting models, development, evaluation, and validation	Jan 21, 2022
model-evaluation-workbench	Java	2	Performance Evaluation of Machine Learning Models	Apr 11, 2022
Object-Detection-Evaluation	Python	8	Evaluation tool for object detection models	Apr 06, 2023
scvi-criticism	Python	4	Evaluation metrics for scvi-tools models	Feb 14, 2023
question-generation-portuguese	Python	2	Automatic Question Generation & Difficulty Control for the Portuguese Language	May 06, 2023
cql-engine	Java	60	Clinical Quality Language Evaluation Engine	Dec 21, 2022