A multilingual dataset for NER covering 91 langauges and 25 scripts
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated a dataset 27 days ago
whoisjones/finerweb_document_context published a dataset 27 days ago
whoisjones/finerweb_document_context updated a dataset 3 months ago
whoisjones/sudokuOrganizations
models 12
whoisjones/otter-bi-mmbert
Token Classification • 0.5B • Updated • 405
whoisjones/otter-bi-rembert
Updated • 10
whoisjones/otter-ce-rembert
Updated • 1
whoisjones/otter-ce-mmbert
Updated • 3
whoisjones/finerweb-multilabel-classifier-xlmr-4o
Text Classification • 0.3B • Updated • 1
whoisjones/finerweb-binary-classifier-xlmr-4o
Text Classification • 0.3B • Updated • 4
whoisjones/finerweb-binary-classifier-xlmr-gemma3
Text Classification • 0.3B • Updated • 5
whoisjones/finerweb-multilabel-classifier-xlmr-gemma3
Text Classification • 0.3B • Updated • 3
whoisjones/finerweb-binary-classifier-mdeberta-gemma3
Text Classification • 0.3B • Updated • 1
whoisjones/finerweb-binary-classifier-mdeberta-4o
Text Classification • 0.3B • Updated
datasets 29
whoisjones/finerweb_document_context
Updated • 127
whoisjones/sudoku
Viewer • Updated • 1.42M • 4
whoisjones/maze
Viewer • Updated • 9k • 8
whoisjones/multinerd
Viewer • Updated • 1.67M • 135
whoisjones/masakhaner
Viewer • Updated • 153k • 11 • 1
whoisjones/uner
Viewer • Updated • 66.8k • 8
whoisjones/fiNERweb
Viewer • Updated • 3.98M • 202 • 9
whoisjones/fiNERweb-x
Updated • 15
whoisjones/fiNERweb-x-multi
Updated • 44
whoisjones/fiNERweb-gemma-x-multi
Updated • 9