Technical University of Denmark
Browse
1/1
8 files

Pretrained sentence BERT models 20 Newsgroups embeddings

dataset
posted on 2023-09-29, 08:13 authored by Beatrix Miranda Ginn NielsenBeatrix Miranda Ginn Nielsen

Embeddings on the 20 Newsgroups dataset using pretrained Sentence BERT models.  

20 Newsgroups dataset: http://qwone.com/~jason/20Newsgroups/

Pretrained models used:
all-distilroberta-v1: https://huggingface.co/sentence-transformers/all-distilroberta-v1
all-MiniLM-L12-v2: https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2
all-mpnet-base-v2: https://huggingface.co/sentence-transformers/all-mpnet-base-v2
multi-qa-distilbert-cos-v1: https://huggingface.co/sentence-transformers/multi-qa-distilbert-cos-v1

Funding

Danish Pioneer Centre for AI, DNRF grant number P1

History

Usage metrics

    DTU Compute

    Categories

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC