A Stochastic Technique to Obtain Training Data for Word Segmentation

Unlike western languages, there exists no word boundary in Japanese. This is why we face to hard problems to analyze documents in Japanese very often. More difficulty arises in expertised domains such as medical, mechanical, computer science documents. In this work, we discuss how to obtain pseudo t...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03 Vol. 3; pp. 283 - 286
Main Authors: Fukuda, Takuya, Miura, Takao
Format: Conference Proceeding
Language:English
Published: Washington, DC, USA IEEE Computer Society 15.09.2009
IEEE
Series:ACM Conferences
Subjects:
ISBN:0769538010, 9780769538013
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first