Skip to content

benchmark: Add MTEB-PT benchmark#4801

Open
Lucas-Okamura wants to merge 3 commits into
embeddings-benchmark:mainfrom
Lucas-Okamura:dataset/mteb-pt
Open

benchmark: Add MTEB-PT benchmark#4801
Lucas-Okamura wants to merge 3 commits into
embeddings-benchmark:mainfrom
Lucas-Okamura:dataset/mteb-pt

Conversation

@Lucas-Okamura

@Lucas-Okamura Lucas-Okamura commented Jun 12, 2026

Copy link
Copy Markdown

Summary

This PR adds MTEB-PT (Portuguese Benchmark) to MTEB as a benchmark with 3 STS, 5 Classification, 3 Reranking and 3 Retrieval Tasks.

Validation

Completed locally:

  • make test

Focused pytest result: 6119 passed, 191 skipped.

Notes

This PR adds only the benchmark with existing MTEB datasets with portuguese subsets. The paper was recently accepted, and I will soon open another PR to add it to the references.

@Samoed Samoed added the new benchmark Issues related to adding a new benchmark label Jun 12, 2026
@Samoed Samoed requested a review from KennethEnevoldsen June 12, 2026 10:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

new benchmark Issues related to adding a new benchmark

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants