热烈欢迎来自全国各地的老师们!语料库语言学常用术语Corpus(语料库,尸体):():acollectionoftext,nowusuallyinmachine-piledtoberepresentativeofaparticularkindoflanguageandoftenprovidedwithsomekindofannotation(标注).按照一定的采样标准采集而来的、能代表一种语言或者某语言的一种变体或文类的电子文本集。OCACLEC桂诗春、杨惠中CorpusLinguistics语料库语言学立足于大量真实的语言数据,主要通过概率统计方法,对语料库做系统而穷尽的观察和概括得出结论。从本质上来讲,是实证性的(empirical).为何要建立语料库?为何要用语料库方法研究语言并将其运用于语言学****Example:Startorbegin?在口语中哪个更常用?我们的老师经常说Let’sbegin!之类的话,对吗?但有人在BNC等语料库中查到,在口语中,start更常用。语料库的方法基于真实的语言使用情况,事实胜于雄辩Acorpuscanbeanalyzedusingsoftwaretools,,,itispossibletoseehowlanguageisreallyused,andtofindanswerstoquestionslikethese:WhatarethemostfrequentwordsandphrasesinEnglish?Whichtensesdopeopleusemostoften?Whatprepositionsfollowparticularverbs?Howdopeopleusewordslikecan,may,andmight?Howmanywordsmustalearnerknowinordertoparticipateineverydayconversation?:acorpusenhancedwithvarioustypesoflinguisticinformation(ortaggedcorpus).Anannotatedcorpusmaybeconsideredtobearepositoryoflinguisticinformation,becausetheinformationwhichwasimplicitintheplaintexthasbeenmadeexplicitthroughconcreteannotation(“addedvalue附加值”).Comparable(reference参照)corpus:parisonofdifferent(typesof),::acorpuswhichrepresentssmallcollectionsofindividualmonolingualcorpora(orsubcorpora)(aligned)corpus:amultilingualcorpuswheretextsinonelanguageandtheirtranslationsintootherlanguagesarealigned,sentencebysentence,:Atypeofcorporathatareassembledforaspecificpurpose,(exceptwithinthescopeoftheirgivenpurpose)and,ifusedforother
语料库常用术语解释 来自淘豆网www.taodocs.com转载请标明出处.