Japanese model #450

vochicong · 2017-06-02T09:25:37Z

Hi, I am interested to using CoreNLP (and DeepDive) with Japanese.
Are you working on Japanese?
And how can I start building a Japanese model?
How did you build Chinese or English models?

Thanks!

manning · 2017-06-10T23:17:35Z

No, we are not currently working on Japanese. The first requirement to make models for any language is labeled data to train models from. Most of the components in CoreNLP use supervised learning. Traditionally, the public availability of Japanese language corpora hasn't been very good, but, now, e.g., the Japanese Universal Dependencies corpora could be used to train several components (segmenter, POS, depparse). However, I still don't know of any usable Japanese NER data. The other requirement is somebody willing to do the work. In general, our expansions to other languages have occurred because somebody was interested in having the language available for some reason.

vochicong · 2017-06-19T10:36:50Z

@manning Thank you for your reply. Actually, I am an NLP newbie and still don't fully understand your valuable answer ;)
But I think that Kuromoji, a Japanese morphological analyzer implemented in Java, looks promising to be embedded in CoreNLP.

I will search for Japanese NER data and if I find one I will share it with you.

BTW, I enjoy your Natural Language Processing with Deep Learning course very much! Thanks for it too!

vochicong · 2017-08-28T05:10:30Z

I found jigg. It's said having similar interface to CoreNLP, actually including CoreNLP and Kuromoji, a Japanese tokenizer. The authors are inspired by CoreNLP, once tried to make an Japanese extension to CoreNLP, but later decided to make jigg for more flexibility.

For Japanese NER, they use JUMAN/KNP.

AngledLuffa · 2019-11-29T08:15:33Z

FWIW (mostly for archival reasons at this point) there are now Japanese models for stanfordnlp

https://stanfordnlp.github.io/stanfordnlp/models.html#human-languages-supported-by-stanfordnlp

vochicong · 2019-12-02T06:17:46Z

Thank you @AngledLuffa for your update.

devanghingu · 2023-02-19T21:20:39Z

@AngledLuffa @vochicong i just checked standford package and stanza. but still not support for process relation extraction(kbp)

AngledLuffa · 2023-02-19T21:44:33Z

There is no relation extraction model in Stanza

manning added enhancement help wanted multilingual labels Jun 10, 2017

polm mentioned this issue May 17, 2019

Japanese Model explosion/spaCy#3756

Closed

AngledLuffa closed this as completed Nov 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Japanese model #450

Japanese model #450

vochicong commented Jun 2, 2017

manning commented Jun 10, 2017

Uh oh!

vochicong commented Jun 19, 2017

Uh oh!

vochicong commented Aug 28, 2017 •

edited

Loading

Uh oh!

AngledLuffa commented Nov 29, 2019

Uh oh!

vochicong commented Dec 2, 2019

Uh oh!

devanghingu commented Feb 19, 2023

Uh oh!

AngledLuffa commented Feb 19, 2023

Uh oh!

Japanese model #450

Japanese model #450

Comments

vochicong commented Jun 2, 2017

manning commented Jun 10, 2017

Uh oh!

vochicong commented Jun 19, 2017

Uh oh!

vochicong commented Aug 28, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AngledLuffa commented Nov 29, 2019

Uh oh!

vochicong commented Dec 2, 2019

Uh oh!

devanghingu commented Feb 19, 2023

Uh oh!

AngledLuffa commented Feb 19, 2023

Uh oh!

vochicong commented Aug 28, 2017 •

edited

Loading