A Stochastic Technique to Obtain Training Data for Word Segmentation
Unlike western languages, there exists no word boundary in Japanese. This is why we face to hard problems to analyze documents in Japanese very often. More difficulty arises in expertised domains such as medical, mechanical, computer science documents. In this work, we discuss how to obtain pseudo t...
Uloženo v:
| Vydáno v: | Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03 Ročník 3; s. 283 - 286 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
Washington, DC, USA
IEEE Computer Society
15.09.2009
IEEE |
| Edice: | ACM Conferences |
| Témata: |
Computing methodologies
> Modeling and simulation
> Model development and analysis
> Modeling methodologies
Mathematics of computing
> Probability and statistics
> Probabilistic reasoning algorithms
> Markov-chain Monte Carlo methods
Mathematics of computing
> Probability and statistics
> Probabilistic reasoning algorithms
> Sequential Monte Carlo methods
Mathematics of computing
> Probability and statistics
> Probabilistic representations
> Markov networks
|
| ISBN: | 0769538010, 9780769538013 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
Buďte první, kdo okomentuje tento záznam!

