|
@_stefan_munich | |||||
|
Turkish-#NLP: Anyone interested in a Turkish BERT and wants to evaluate it on downstream tasks?
I did evaluation only for UD PoS tagging - any help is really appreciated!
Would really like to have a proper evaluation before adding it to the @huggingface Transformers hub🤗
|
||||||
|
||||||
|
Stefan
@_stefan_munich
|
2. velj |
|
Thanks for your positive reactions 🤗
I'll try to answer all questions now 😅
The repo for Turkish BERT model can be found here:
github.com/stefan-it/turk…
|
||
|
|
||
|
Bram Vanroy
@BramVanroy
|
1. velj |
|
Great work, as usual! Can you share the results of the PoS tagging? I'm very curious about these highly morphological languages and how LM perform on them.
|
||
|
|
||
|
Stefan
@_stefan_munich
|
2. velj |
|
Repo is here, I've already added the PoS tagging results 😄
github.com/stefan-it/turk…
|
||
|
|
||
|
ibrahim e. demirci
@iedmrc
|
1. velj |
|
What is your training corpus?
|
||
|
|
||
|
Stefan
@_stefan_munich
|
2. velj |
|
Currently I'm using the Turkish part of the OSCAR corpus:
traces1.inria.fr/oscar/
|
||
|
|
||
|
Gözde Gül Sahin
@GzdeGlSahin1
|
1. velj |
|
Stefan, we had created small linguistic tests (probing tasks) exactly for this purpose. We also have many for Turkish, and a web service for Transformer models: …pector.ukp.informatik.tu-darmstadt.de
|
||
|
|
||
|
Stefan
@_stefan_munich
|
2. velj |
|
This looks pretty awesome 😍
I just uploaded an archive (with pytorch_model.bin, vocab.txt and config.json) but the interface still shows "uploading" - I'll wait a bit now. Should I try the GitHub repo 🤔
|
||
|
|
||
|
Gokhan Tur
@tur_gokhan
|
1. velj |
|
|
||
|
Kemal Oflazer
@oflazer
|
1. velj |
|
Stefan I have asked some colleagues to contact you for this. If several billions of words of additional Turkish text would improve your model, let me know and I can dig up what I have.
|
||
|
|
||
|
Zehra
@zehayirci
|
1. velj |
|
BERTAN 👍
|
||
|
|
||
|
Evin Pinar Ornek
@evinpinar
|
1. velj |
|
ahah it is a weird name that I checked if it really existed. there are not any normal names in Turkish that has "bert" in it. But I would suggest using Turkish translation of the name "Bert" from the Sesame Street: "Büdü" ☺️ Elmo is "Edi".
|
||
|
|
||