We created a Japanese web corpus based on the content obtained from approximately 1.5 billion URLs that we crawled ourselves. In this presentation, we will introduce how we created the corpus and how we use it within our company.



DAY 1
16:55-17:10 JST
Seminar Room B
JaEnKo
Onsite