Kazuhide Yamamoto


2018

pdf bib
Lexical Substitution is Practical for Rare Word Simplification
Takumi Maruyama | Kazuhide Yamamoto
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation

pdf bib
Dataset Construction Method for Word Reading Disambiguation
Koki Nishiyama | Kazuhide Yamamoto | Hideharu Nakajima
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation

pdf bib
VietSentiLex: a sentiment dictionary that considers the polarity of ambiguous sentiment words
Huynh Quoc Viet Vo | Kazuhide Yamamoto
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation

pdf bib
Crowdsourced Corpus of Sentence Simplification with Core Vocabulary
Akihiro Katsuta | Kazuhide Yamamoto
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

pdf bib
Simplified Corpus with Core Vocabulary
Takumi Maruyama | Kazuhide Yamamoto
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2017

pdf bib
Controlling Target Features in Neural Machine Translation via Prefix Constraints
Shunsuke Takeno | Masaaki Nagata | Kazuhide Yamamoto
Proceedings of the 4th Workshop on Asian Translation (WAT2017)

We propose prefix constraints, a novel method to enforce constraints on target sentences in neural machine translation. It places a sequence of special tokens at the beginning of target sentence (target prefix), while side constraints places a special token at the end of source sentence (source suffix). Prefix constraints can be predicted from source sentence jointly with target sentence, while side constraints (Sennrich et al., 2016) must be provided by the user or predicted by some other methods. In both methods, special tokens are designed to encode arbitrary features on target-side or metatextual information. We show that prefix constraints are more flexible than side constraints and can be used to control the behavior of neural machine translation, in terms of output length, bidirectional decoding, domain adaptation, and unaligned target word generation.

2016

pdf bib
Integrating empty category detection into preordering Machine Translation
Shunsuke Takeno | Masaaki Nagata | Kazuhide Yamamoto
Proceedings of the 3rd Workshop on Asian Translation (WAT2016)

We propose a method for integrating Japanese empty category detection into the preordering process of Japanese-to-English statistical machine translation. First, we apply machine-learning-based empty category detection to estimate the position and the type of empty categories in the constituent tree of the source sentence. Then, we apply discriminative preordering to the augmented constituent tree in which empty categories are treated as if they are normal lexical symbols. We find that it is effective to filter empty categories based on the confidence of estimation. Our experiments show that, for the IWSLT dataset consisting of short travel conversations, the insertion of empty categories alone improves the BLEU score from 33.2 to 34.3 and the RIBES score from 76.3 to 78.7, which imply that reordering has improved For the KFTT dataset consisting of Wikipedia sentences, the proposed preordering method considering empty categories improves the BLEU score from 19.9 to 20.2 and the RIBES score from 66.2 to 66.3, which shows both translation and reordering have improved slightly.

2015

pdf bib
Evaluation Dataset and System for Japanese Lexical Simplification
Tomoyuki Kajiwara | Kazuhide Yamamoto
Proceedings of the ACL-IJCNLP 2015 Student Research Workshop

pdf bib
Empty Category Detection using Path Features and Distributed Case Frames
Shunsuke Takeno | Masaaki Nagata | Kazuhide Yamamoto
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

2014

pdf bib
Noun Paraphrasing Based on a Variety of Contexts
Tomoyuki Kajiwara | Kazuhide Yamamoto
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing

2013

pdf bib
Selecting Proper Lexical Paraphrase for Children
Tomoyuki Kajiwara | Hiroshi Matsumoto | Kazuhide Yamamoto
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing (ROCLING 2013)

2011

pdf bib
A Comparison of Unsupervised Bilingual Term Extraction Methods Using Phrase-Tables
Masamichi Ideue | Kazuhide Yamamoto | Masao Utiyama | Eiichiro Sumita
Proceedings of Machine Translation Summit XIII: Papers

2010

pdf bib
Detecting Nasty Comments from BBS Posts
Tatsuya Ishisaka | Kazuhide Yamamoto
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation

pdf bib
Generation of Summaries that Appropriately and Adequately Express the Contents of Original Documents Using Word-Association Knowledge
Kazuki Takigawa | Masaki Murata | Masaaki Tsuchida | Stijn De Saeger | Kazuhide Yamamoto | Kentaro Torisawa
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation

pdf bib
Using Goi-Taikei as an Upper Ontology to Build a Large-Scale Japanese Ontology from Wikipedia
Masaaki Nagata | Yumi Shibaki | Kazuhide Yamamoto
Proceedings of the 6th Workshop on Ontologies and Lexical Resources

pdf bib
Textual Entailmaint Recognition using Word Overlap, Mutual Information and Subpath Set
Yuki Muramatsu | Kunihiro Uduka | Kazuhide Yamamoto
Proceedings of the 2nd Workshop on Cognitive Aspects of the Lexicon

pdf bib
Constructing Large-Scale Person Ontology from Wikipedia
Yumi Shibaki | Masaaki Nagata | Kazuhide Yamamoto
Proceedings of the 2nd Workshop on The People’s Web Meets NLP: Collaboratively Constructed Semantic Resources

pdf bib
Even Unassociated Features Can Improve Lexical Distributional Similarity
Kazuhide Yamamoto | Takeshi Asakura
Proceedings of the Second Workshop on NLP Challenges in the Information Explosion Era (NLPIX 2010)

2009

pdf bib
Development of a Japanese-English Software Manual Parallel Corpus
Tatsuya Ishisaka | Masao Utiyama | Eiichiro Sumita | Kazuhide Yamamoto
Proceedings of Machine Translation Summit XII: Posters

2008

pdf bib
Generating Story Reviews Using Phrases Expressing Emotion
Hiroshi Ota | Kazuhide Yamamoto
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation

pdf bib
Extracting Troubles from Daily Reports based on Syntactic Pieces
Yoshifumi Kakimoto | Kazuhide Yamamoto
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation

pdf bib
Summarization by Analogy: An Example-based Approach for News Articles
Megumi Makino | Kazuhide Yamamoto
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-II

2007

pdf bib
Opinion Extraction based on Syntactic Pieces
Suguru Aoki | Kazuhide Yamamoto
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation

2006

pdf bib
A Clustered Global Phrase Reordering Model for Statistical Machine Translation
Masaaki Nagata | Kuniko Saito | Kazuhide Yamamoto | Kazuteru Ohashi
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics

pdf bib
Using Phrasal Patterns to Identify Discourse Relations
Manami Saito | Kazuhide Yamamoto | Satoshi Sekine
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers

2005

pdf bib
A System to Solve Language Tests for Second Grade Students
Manami Saito | Kazuhide Yamamoto | Satoshi Sekine | Hitoshi Isahara
Companion Volume to the Proceedings of Conference including Posters/Demos and tutorial abstracts

pdf bib
Transforming a Sentence End into News Headline Style
Satoshi Ikeda | Kazuhide Yamamoto
Proceedings of the Third International Workshop on Paraphrasing (IWP2005)

pdf bib
NUT-NTT Statistical Machine Translation System for IWSLT 2005
Kazuteru Ohashi | Kazuhide Yamamoto | Kuniko Saito | Masaaki Nagata
Proceedings of the Second International Workshop on Spoken Language Translation

2004

pdf bib
A Comparison of Two Variant Corpora: The Same Content with Different Source
Kyonghee Paik | Kiyonori Ohtake | Kazuhide Yamamoto
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

In order to investigate the effect of source language on translations, we investigate two variants of a Korean translation corpus. The first variant consists of Korean translations of 162,308 Japanese sentences from the ATR BTEC (Basic Expression Text Corpus). The second variant was made by translating the English translations of the Japanese sentences into Korean. We show that the source language text has a large influence on the target text. Even after normalizing orthographic differences, fewer than 8.3\% of the sentences in the two variants were identical. We describe in general which phenomena differ and then discuss how our analysis can be used in natural language processing.

pdf bib
Detecting Transliterated Orthographic Variants via Two Similarity Metrics
Kiyonori Ohtake | Youichi Sekiguchi | Kazuhide Yamamoto
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

2003

pdf bib
Applicability Analysis of Corpus-derived Paraphrases toward Example-based Paraphrasing
Kiyonori Ohtake | Kazuhide Yamamoto
Proceedings of the 17th Pacific Asia Conference on Language, Information and Computation

2002

pdf bib
Paraphrasing of Chinese Utterances
Yujie Zhang | Kazuhide Yamamoto
COLING 2002: The 19th International Conference on Computational Linguistics

pdf bib
Machine Translation by Interaction between Paraphraser and Transfer
Kazuhide Yamamoto
COLING 2002: The 19th International Conference on Computational Linguistics

pdf bib
Towards a Thesaurus of Predicates
Satoshi Shirai | Kazuhide Yamamoto | Francis Bond | Hozumi Tanaka
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

pdf bib
Acquisition of Lexical Paraphrases from Texts
Kazuhide Yamamoto
COLING-02: COMPUTERM 2002: Second International Workshop on Computational Terminology

pdf bib
Corpus-assisted expansion of manual MT knowledge:
Setsuo Yamada | Kenji Imamura | Kazuhide Yamamoto
Proceedings of the 9th Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages: Papers

1999

pdf bib
Corpus-Based Anaphora Resolution Towards Antecedent Preference
Michael Paul | Kazuhide Yamamoto | Eiichiro Sumita
Coreference and Its Applications

pdf bib
Solutions to problems inherent in spoken-language translation: the ATR-MATRIX approach
Eiichiro Sumita | Setsuo Yamada | Kazuhide Yamamoto | Michael Paul | Hideki Kashioka | Kai Ishikawa | Satoshi Shirai
Proceedings of Machine Translation Summit VII

ATR has built a multi-language speech translation system called ATR-MATRIX. It consists of a spoken-language translation subsystem, which is the focus of this paper, together with a highly accurate speech recognition subsystem and a high-definition speech synthesis subsystem. This paper gives a road map of solutions to the problems inherent in spoken-language translation. Spoken-language translation systems need to tackle difficult problems such as ungrammaticality. contextual phenomena, speech recognition errors, and the high-speeds required for real-time use. We have made great strides towards solving these problems in recent years. Our approach mainly uses an example-based translation model called TDMT. We have added the use of extra-linguistic information, a decision tree learning mechanism, and methods dealing with recognition errors.

1998

pdf bib
Splitting Long or Ill-formed Input for Robust Spoken-language Translation
Osamu Furuse | Setsuo Yamada | Kazuhide Yamamoto
COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics

pdf bib
Feasibility Study for Ellipsis Resolution in Dialogues by Machine-Learning Technique
Kazuhide Yamamoto | Eiichiro Sumita
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics

pdf bib
Splitting Long or Ill-formed Input for Robust Spoken-language Translation
Osamu Furuse | Setsuo Yamada | Kazuhide Yamamoto
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 1

pdf bib
Feasibility Study for Ellipsis Resolution in Dialogues by Machine-Learning Technique
Kazuhide Yamamoto | Eiichiro Sumita
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2