Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 24

1	『日本語歴史コーパス』統合語彙表（バージョン2022.03）
	国立国語研究所通時コーパスプロジェクト; 小木曽智信; National Institute for Japanese Language and Linguistics, Diachronic Corpus Project. - : 国立国語研究所, 2022
	BASE
	Show details

2	『日本語歴史コーパス』短単位統合語彙表（バージョン2021.03）
	国立国語研究所通時コーパスプロジェクト; 小木曽智信; National Institute for Japanese Language and Linguistics, Diachronic Corpus Project. - : 国立国語研究所, 2021
	BASE
	Show details

3	コーパスで日本語の歴史を探る : 「通時コーパス」プロジェクトの取り組み
	小木曽智信
	In: https://www2.ninjal.ac.jp/openhouse/2019/ (2021)
	BASE
	Show details

4	コーパスを使った日本語の歴史の研究
	小木曽智信
	In: https://www2.ninjal.ac.jp/openhouse/2018/ (2021)
	BASE
	Show details

5	『日本語歴史コーパス』の紹介
	小木曽智信; 松崎安子
	In: https://www2.ninjal.ac.jp/openhouse/2020/ (2021)
	BASE
	Show details

6	多重の読みを持つ宣命コーパスの構築
	呉寧真; 池田幸恵; 須永哲也...
	In: http://id.nii.ac.jp/1001/00208605/ (2020)
	BASE
	Show details

7	近代の歴史的資料を対象とした機械学習による文境界推定
	白井良介; 松村雪桜; 小木曽智信...
	In: http://id.nii.ac.jp/1001/00203040/ (2020)
	BASE
	Show details

8	「人情本コーパス」の設計と構築
	藤本灯; 北﨑勇帆; 市村太郎. - : 国立国語研究所, 2017
	BASE
	Show details

9	多重の読みを持つテキストのコーパス化
	小木曽智信; Toshinobu OGISO
	In: http://pj.ninjal.ac.jp/corpus_center/lrw2016.html (2017)
	BASE
	Show details

10	『日本語歴史コーパス』における原文KWIC表示機能の実装
	小木曽智信; 岡照晃; 中村壮範...
	In: http://pj.ninjal.ac.jp/corpus_center/lrw2017.html (2017)
	BASE
	Show details

11	日本語コーパスの包括的検索環境の実現に向けて
	前川喜久雄; 浅原正幸; 小木曽智信...
	In: http://pj.ninjal.ac.jp/corpus_center/lrw2016.html (2017)
	BASE
	Show details

12	〈著書紹介〉近藤泰弘，田中牧郎，小木曽智信編『コーパスと日本語史研究』
	小木曽智信. - : 国立国語研究所, 2016
	BASE
	Show details

13	『現代日本語書き言葉均衡コーパス』の文境界修正
	小西光; 中村壮範; 田中弥生. - : 国立国語研究所, 2015
	BASE
	Show details

14	パネル・ディスカッション
	清水康行; 小林千草; 田中牧郎. - : 国立国語研究所, 2014
	BASE
	Show details

15	〈共同研究プロジェクト紹介〉萌芽・発掘型 : 統計と機械学習による日本語史研究歴史的日本語資料のアノテーションと自動濁点付与
	小木曽智信; Toshinobu OGISO. - : 国立国語研究所, 2013
	Abstract: 国立国語研究所言語資源研究系 ; 通時コーパスの構築に必要とされる歴史的日本語資料のアノテーションの全体について俯瞰した上で,アノテーション作業の自動化の試みの一つとして濁点の自動付与に関する研究成果を紹介する。歴史的資料では,濁点が十分に付与されていないものが少なくないが,そのままでは読みにくく検索や形態素解析にとって不都合である。そこで統計的機械学習に基づく自動濁点付与の手法を開発し,適合率約96%,再現率約98%での濁点付与を可能にした。これにより通時コーパス構築の作業負担の軽減が期待できる。最後に,今後の歴史コーパスに期待される高度なアノテーションについて展望する。 ; Following a survey of annotations for historical Japanese documents that are required for the construction of a diachronic corpus, I introduce the results of our research on adding dakuten (the voicing diacritic) automatically. Raw historical texts often include characters with dakuten omitted, but such texts degrade readability and retrievability and are not suitable for morphological analysis. We therefore developed an automatic annotation technique for dakuten based on statistical machine learning that has a precision rate of approximately 96% and a recall rate of approximately 98%. This technique can reduce the work involved in diachronic corpus construction. Finally, I discuss the high-level annotation that can be expected in diachronic corpora from now on.
	URL: http://id.nii.ac.jp/1328/00000743/ https://repository.ninjal.ac.jp/?action=repository_action_common_download&item_id=752&item_no=1&attribute_id=54&file_no=1 https://repository.ninjal.ac.jp/?action=repository_uri&item_id=752
	BASE
	Hide details

16	中古和文における個人文体とジャンル文体 : 多変量解析による歴史的資料の文体研究
	小林雄一郎; 小木曽智信; Yuichiro KOBAYASHI. - : 国立国語研究所, 2013
	BASE
	Show details

17	〈受賞紹介〉中古和文を対象とした形態素解析辞書の開発
	小木曽智信. - : 国立国語研究所, 2012
	BASE
	Show details

18	〈全文〉近代語コーパス設計のための文献言語研究成果報告書
	田中牧郎; 岡島昭浩; 小木曽智信. - : 国立国語研究所, 2012
	BASE
	Show details

19	近代語テキストの形態素解析
	小木曽智信; Toshinobu OGISO. - : 国立国語研究所, 2012
	BASE
	Show details

20	『現代日本語書き言葉均衡コーパス』形態論情報データベースの設計と実装改訂版
	小木曽智信; 中村壮範. - : 国立国語研究所, 2011
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern