打印本文 打印本文  關閉窗口 關閉窗口  
      英國國家語料庫(BNC)介紹
      作者:admin  文章來源:本站原創  點擊數  更新時間:2011-11-16  文章錄入:admin  責任編輯:admin

       

      How the BNC was created

      The BNC project was carried out and is managed by the BNC Consortium, an industrial/academic consortium led by Oxford University Press, of which the other members are major dictionary publishers Addison-Wesley Longman and Larousse Kingfisher Chambers; academic research centres at Oxford University Computing Services (OUCS), the University Centre for Computer Corpus Research on Language (UCREL) at Lancaster University, and the British Library's Research and Innovation Centre. The project was funded by the commercial partners, the Science and Engineering Council (now EPSRC) and the UK government's Department of Trade and Industry under the Joint Framework for Information Technology (JFIT) programme. Additional support was provided by the British Library and the British Academy.

      Creation process in brief

      The creation of the corpus started with a careful planning stage where the design principles were drawn up. These principles included the selection criteria that were used as the basis for the collection of the texts (a separate section describes the selection criteria for the written and the spoken parts of the corpus).

      Once a suitable texts was identified and permission to use it had been obtained, the text was converted to machine readable form. The conversion was performed by one of the commercial partners (OUP, Longman or Chambers). The resulting text was then converted to the standard project encoding format at OUCS, where its accuracy and internal consistency was also validated. The text was then passed to UCREL, where word class tagging was automatically added, and returned to OUCS for documentation and accession into the corpus. Each stage of corpus processing was recorded in a database maintained at OUCS.

      Work on building the corpus commenced in 1991 and was completed in 1994. The first general release of the corpus for European researchers was announced in February 1995. After the completion of the first edition of the BNC, a phase of tagging improvement was undertaken at Lancaster University with funding from the Engineering and Physical Sciences Research Council (Research Grant No. GR/F 99847). This tagging enhancement project was led by Geoffrey Leech, Roger Garside and Tony McEnery. Correction and validation of the bibliographic and contextual information in all the BNC Headers was also carried out for this second version of the corpus, known as the BNC World Edition. BNC World was made available for world-wide distribution in 2001. In response to user feedback, the original SGML version of the corpus was later converted into XML. Additional mark-up for lemma and simplified word-class annotation was added and the treatment of multi-word units was improved. Minor errors and inconsistencies were also corrected. BNC XML Edition was released in 2007. Two sub-sets from the corpus have been published separately: the BNC Sampler and the BNCBaby.

      web address:

      http://www.scottishcorpus.ac.uk/cmsw/ 

      more corpus addresses:

      /Article/201111/2702.html

       

          ■點擊→英語疑難問題·綜合解答

       

      上一頁  [1] [2] 

      打印本文 打印本文  關閉窗口 關閉窗口  
      主站蜘蛛池模板: 国产精品久久久久久久| 收集最新中文国产中文字幕| 偷自拍亚洲视频在线观看99| 麻花传MD034苏蜜清歌| 在线播放精品一区二区啪视频| 久久久久久久久人体| 欧美在线视频一区| 偷偷做久久久久网站| 色狠狠色狠狠综合一区| 国产福利高颜值在线观看| jizzjizzjizz中国| 扒开两腿猛进入爽爽视频| 九九影院理论片在线观看一级| 波多野结衣456| 动漫美女被吸乳羞羞网站动漫| 高分少女免费观看第一季| 国产综合久久久久鬼色| 一区二区三区在线看| 日本免费精品一区二区三区| 亚洲人成伊人成综合网久久久 | 夜色私人影院永久入口| 丰满少妇AAAAAA爰片毛片| 李宗瑞60集k8经典网| 亚洲欧美日韩国产综合五月天| 精品亚洲一区二区三区在线播放| 国产仑乱无码内谢| 色人阁在线视频| 国产精品视频全国免费观看 | 污污视频免费观看网站| 午夜影视免费完整高清在线观看网站| 韩日午夜在线资源一区二区| 国产精品久久久小说| 99久久国语露脸精品国产| 影音先锋在线_让看片永远陪伴| 久久国产中文字幕| 最近的免费中文字幕视频| 亚洲熟妇少妇任你躁在线观看| 秋葵视频在线观看在线下载| 四虎国产精品免费久久久| 青青草成人影院| 国产无遮挡又黄又爽在线观看 |