<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
<channel>
<title><![CDATA[☆☆ 卢伟个人主页 _____ ☆☆☆☆☆ _____ Lu Wei's Homepage ☆☆]]></title>
<link>http://www.luweixmu.com/home/</link>
<description><![CDATA[对外汉语教学／第二语言教学／(应用)语言学网络资源 Web Resources for TCFL, L2, &amp; (Applied) Linguistics]]></description>
<language>zh-cn</language>
<copyright><![CDATA[Copyright 2005 PBlog3 v2.8]]></copyright>
<webMaster><![CDATA[luwig@xmu.edu.cn(簏巍)]]></webMaster>
<generator>PBlog2 v2.4</generator> 
<image>
	<title>☆☆ 卢伟个人主页 _____ ☆☆☆☆☆ _____ Lu Wei&#39;s Homepage ☆☆</title>
	<url>http://www.luweixmu.com/home/images/logos.gif</url>
	<link>http://www.luweixmu.com/home/</link>
	<description>☆☆ 卢伟个人主页 _____ ☆☆☆☆☆ _____ Lu Wei&#39;s Homepage ☆☆</description>
</image>

			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=167</link>
			<title><![CDATA[New Publications of Linguistic Data Consortium]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[语言学]]></category>
			<pubDate>Tue,20 Jul 2010 01:32:29 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=167</guid>
		<description><![CDATA[In this newsletter:<br/>- 2010 Publication Pipeline Up&#100;ate -<br/>- Up&#100;ated LDC Data Sheets and Papers Pages -<br/><br/>New publications:<br/>- LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1 - <br/>- NIST 2004 Open Machine Translation (OpenMT) Evaluation -<br/><br/>2010 Publication Pipeline Up&#100;ate<br/><br/>Membership Year (MY) 2010 has included a strong sel&#101;ction of publications including up&#100;ates to the Arabic and Chinese treebanks, Spanish telephone speech and transcript data from the Fisher collection, and Chinese word n-grams collected from the web .&nbsp;&nbsp;Please consult our corpus catalog for a full list of publications distributed by LDC. As we are now in the second half of this membership year, we would like to provide information on what publications you can expect for the remainder of MY2010.&nbsp;&nbsp;Our pipeline includes the following:<br/><br/>Arabic Treebank Part 1 Version 4.1 ~ a revision of Arabic Treebank: Part 1 v 3.0 (POS with full vocalization + syntactic analysis) (LDC2005T02) (ATB1), according to the new Arabic Treebank (ATB) annotation guidelines.&nbsp;&nbsp;The Arabic Treebank project consists of two distinct phases: (a) Part-of-Speech (POS) tagging which divides the text into lexical tokens, and gives relevant information about each token such as lexical category, inflectional features, and a gloss, and (b) Arabic Treebanking which characterizes the constituent structures of word sequences, provides categories for each non-terminal node, and identifies null elements, co-reference, traces, etc. on-terminal node.&nbsp;&nbsp; Arabic Treebank Part 1 Version 4.1 represents the manual revision of the syntactic tree annotation in ATB1, the automatic revision and updating of certain part-of-speech tags, and the manual revision of certain targeted POS tags (function words, in particular).&nbsp;&nbsp;The source data consists of 734 newswire stories from Agence France Presse.<br/><br/>Microsoft Research India POS-Tagged Bengali - to support the task of Part-of-Speech Tagging (POS) and other forms of data-driven linguistic research on Indian languages in general, Microsoft Research India has developed POS labeled data for Hindi, Bengali, and Sanskrit as a part of the Indian Language – Part-of-Speech Tagset (IL-POST) project.&nbsp;&nbsp;The corpora are based on the IL-POST framework. IL-POST is a POS-tagset framework which has been designed to cover the morph-syntactic details of Indian languages. It supports a three-level hierarchy of Categories, Types and Attributes. The Bengali corpus consists of two different levels of information for each lexical token: (a) lexical category and types, and (b) set morphological attributes and their associated values in the context.&nbsp;&nbsp;The data consists of 7168 manually annotated sentences (102933 words) targeted to cover written modern standard Bengali from various sources, including blogs, Multikulti, and Wikipedia. .<br/><br/>TRECVID 2006 Keyframes and Transcripts ~ TREC Video Retrieval Evaluation (TRECVID) is sponsored by NIST to promote progress in content-based retrieval from digital video via open, metrics-based evaluation. The keyframes in this release were extracted for use in the NIST TRECVID 2006 Evaluation.&nbsp;&nbsp;The source data includes approximately 158.6 hours of English, Arabic and Chinese language video data collected by LDC from NBC, CNN, MSN, New Tang Dynasty TV, Phoenix TV, Lebanese Broadcasting Corp.,&nbsp;&nbsp;and China Central TV.&nbsp;&nbsp;The keyframes were sel&#101;cted by going to the middle frame of the shot boundary, then parsing left and right of that frame to locate the nearest I-Frame. This then became the keyframe and was extracted. Keyframes have been provided at both the subshot (NRKF) and master shot (RKF) levels. <br/><br/>Uda Walawe Asian Elephant Vocalizations ~ partially-annotated corpus of Asian Elephant communication/vocalization. The data set contains vocalizations primarily by adult female and juvenile Asian elephants. This corpus is intended to enable researchers in acoustic communication of elephants and other species to compare acoustic features and repertoire diversity to this population. Of particular interest is whether there may be regional dialects that differ among Asian elephant populations in the wild and in captivity. A second interest is in whether structural commonalities exist between this and other species that shed light on underlying social and ecological factors shaping communication systems. <br/><br/>2010 Subscription Members are automatically sent all MY2010 data as it is released.&nbsp;&nbsp;2010 Standard Members are entitled to request 16 corpora for free from MY2010.&nbsp;&nbsp; Non-members may license most data for research use.<br/><br/>Up&#100;ated LDC Data Sheets and Papers Pages<br/><br/>LDC is pleased to announce that both our LDC Data Sheets and LDC Papers pages recently have been up&#100;ated.&nbsp;&nbsp;On our Data Sheets page, you&#39;ll find our growing collection of LDC Data Sheets, each of which highlights a key aspect of the Consortium’s research and development tasks.&nbsp;&nbsp;Recent additions include a data sheet covering Arabic and English treebanking at LDC and one that provides an overview of LDC&#39;s role in sponsored projects.&nbsp;&nbsp; Our up&#100;ated papers page contains several papers from LREC2010:&nbsp;&nbsp;Seventh International Conference on Language Resources and Evaluation, as well as other conferences and journals, dating from 1998 forward.&nbsp;&nbsp;Most papers are available for download in pdf format; presentations slides and posters are available for several papers as well.<br/><br/>On our Papers page, you can read about LDC&#39;s efforts to apply treebank annotation to Arabic broadcast news (Maamouri et al).&nbsp;&nbsp;Broadcast news (BN) transcript data posed new challenges; for instance, the transcript data included metadata which conveys information in addition to the text of what is being said.&nbsp;&nbsp; Some forms of metadata were ignored, such as indications of coughs o&#114; laughter, while others, such as speech effects including discourse markers and word fragments, were annotated.&nbsp;&nbsp;Annotators also had to handle indistinct audio signal wh&#101;rein speech could be heard, but not fully understood, so the words could only be inferred from context rather than from the audio signal.&nbsp;&nbsp;In these cases, the annotation must convey information not contained in the audio signal that accounts for the annotation in that region.&nbsp;&nbsp;The improved Arabic Treebank (ATB) pipeline and revised annotation guidelines proved robust enough to carry out this task with few changes. This paper discusses wh&#101;re some adaptation was necessary and describes the overall pipeline as used in the production of BN ATB data.<br/><br/>Additionally, you can learn about LDC&#39;s role in resource creation for the Knowledge Base Population (KBP) Track of the Text Analysis Conference (TAC) o&#114;ganized by NIST (Simpson et al).&nbsp;&nbsp;The KBP track of TAC is a hybrid descendant of the TREC Question Answering track and the Automated Content Extraction (ACE) evaluation program and is designed to support development of systems that are capable of automatically populating a knowledge base with information about entities mined from unstructured text. An important component of the KBP evaluation is the Entity Linking task, wh&#101;re systems must accurately associate text mentions of unknown Person (PER), o&#114;ganization (ORG), and Geopolitical (GPE) names to entries in a knowledge base. This paper describes the 2009 resource creation efforts, with particular focus on the sel&#101;ction and development of named entity mentions for the Entity Linking task evaluation.<br/><br/>New Publications<br/><br/>(1)&nbsp;&nbsp;The LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1 was developed by researchers at LDC. SAMA 3.1 is based on, and up&#100;ates Tim Buckwalt&#101;r&#39;s Buckwalt&#101;r Arabic Morphological Analyzer (BAMA) 2.0 (LDC2004L02). Since this is the first public release of SAMA, it has been numbered continuously to reflect the continuity between this release and previous BAMA releases.&nbsp;&nbsp;SAMA 3.1 is a software tool for the morphological analysis of Standard Arabic. SAMA 3.1 considers each Arabic word token in all possible &#39;prefix-stem-suffix&#39; segmentations, and lists all known/possible annotation solutions, with assignment of all diacritic marks, morpheme boundaries (separating clitics and inflectional morphemes from stems), and all Part-of-Speech (POS) labels and glosses for each morpheme segment. The generated output may then be reviewed by users, and the most appropriate annotation sel&#101;cted from among several choices.<br/><br/>The software layer of SAMA 3.1 relies on a data layer that consists primarily of three Arabic-English lexicon files: prefixes (1328 entries), suffixes (945 entries), and stems (79318 entries representing 40654 lemmas). The lexicons are supplemented by three morphological compatibility tables used for controlling prefix-stem combinations (2497 entries), stem-suffix combinations (1632 entries), and prefix-suffix combinations (1180 entries). <br/><br/>The input format, output format, and data layer of SAMA 3.1 were designed to be backward compatible with BAMA. Incremental changes to the data layer in SAMA have resulted in: <br/><br/>increased lexicon coverage in the dictionary files <br/>important changes and additions to the inventory of POS tags <br/>more possible solutions generated for numerous word forms <br/>The software implementation has been up&#100;ated to allow more input/output options, installation and configuration options, and smoother incorporation in other Perl tools/services. The structure of the dictionary and morphotactic tables has remained the same (the tables provided with SAMA 3.1 differ from the BAMA 2.0 tables only in size and content, not in format). Logical separation between the software layer and data layer allows the new software tools to be used with previous versions of the tables (instructions are provided with software documentation).&nbsp;&nbsp;The basic logic that implements the segmentation and analysis look-up for Arabic words is essentially unchanged since BAMA 2.0. <br/><br/>The data layer is now accessed through Berkeley DB, with result-caching enabled by default, leading to improved performance. Various utility scripts have also been added to the software package to facilitate more flexible interaction with tools and data.<br/><br/>LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1 is distributed via web download. <br/><br/>2010 Subscription Members will automatically receive two copies of this corpus on disc, provided that they have submitted a completed copy of the User License Agreement for LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1 (LDC2010L01).&nbsp;&nbsp;2010 Standard Members may request a copy as part of their 16 free membership corpora. As a Members-Only release, LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1 is not available for non-member licensing.<br/><br/>(2)&nbsp;&nbsp;NIST 2004 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2004 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems. The package was compiled and scoring software was developed by researchers at NIST, making use of newswire source data and reference translations collected and developed by LDC.<br/><br/>The objective of the NIST OpenMT evaluation series is to support research in, and help advance the state of the art of, machine translation (MT) technologies -- technologies that translate text between human languages. Input may include all forms of text. The goal is for the output to be an adequate and fluent translation of the o&#114;iginal.&nbsp;&nbsp;The 2004 task was to evaluate translation from Chinese to English and from Arabic to English. Additional information about these evaluations may be found at the NIST Open Machine Translation (OpenMT) Evaluation web site. <br/><br/>This evaluation kit includes a single perl script (mteval-v11a.pl) that may be used to produce a translation quality score for one (or more) MT systems. The script works by comparing the system output translation with a set of (expert) reference translations of the same source text. Comparison is based on finding sequences of words in the reference translations that match word sequences in the system output translation. <br/><br/>This corpus consists of 150 Arabic newswire documents, 150 Chinese newswire documents, and 29 Chinese &#34;prepared speech&#34; documents. For each language, the test set consists of two files: a source and a reference file. Each reference file contains four independent translations of the data set. The evaluation year, source language, test set, version of the data, and source vs. reference file are reflected in the file name. <br/><br/>NIST 2004 Open Machine Translation (OpenMT) Evaluation is distributed via web download. <br/><br/>2010 Subscription Members will automatically receive two copies of this corpus on disc.&nbsp;&nbsp;2010 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$150.<br/><br/>Ilya Ahtaridis<br/>Membership Coordinator<br/><br/>Linguistic Data Consortium&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Phone: (215) 573-1275<br/>University of Pennsylvania&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Fax: (215) 573-2175<br/>3600 Market St., Suite 810&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ldc@ldc.upenn.edu<br/>Philadelphia, PA 19104 USA&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href="http://www.ldc.upenn.edu" target="_blank" rel="external">http://www.ldc.upenn.edu</a><br/><br/>]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=166</link>
			<title><![CDATA[New Publications of Linguistic Data Consortium]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[语言学]]></category>
			<pubDate>Sun,16 May 2010 21:49:30 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=166</guid>
		<description><![CDATA[In this newsletter:- Coming Soon: LDC Data Scholarship Program! -<br/>New publications:<br/>LDC2010S03- 2003 NIST Speaker Recognition Evaluation -<br/>LDC2010T09- ACE 2005 Mandarin SpatialML Annotations -<br/>LDC2010T10- NIST 2002 Open Machine Translation (OpenMT) Evaluation -<br/>--------------------------------------------------------------------------------<br/>Coming Soon: LDC Data Scholarship Program!<br/>We are pleased to announce that the LDC Data Scholarship program is in the works! This program will provide university students with access to LDC data at no-cost. Each year LDC distributes thousands of dollars worth of data at no- o&#114; reduced-cost to students who demonstrate a need for data, yet cannot secure funding.&nbsp;&nbsp;LDC will formalize this practice through the newly cr&#101;ated LDC Data Scholarship program. <br/>Data scholarships will be offered each semester beginning with the fall 2010 semester (September - December 2010). Students will need to complete an application, which will include a data use proposal and letter of support from their faculty adviser.&nbsp;&nbsp;We anticipate that the sel&#101;ction process will be highly competitive.<br/>Stay tuned for further announcements in our newsletter and on our home page!<br/><br/>New Publications<br/>(1) 2003 NIST Speaker Recognition Evaluation was developed by researchers at NIST (National Institute of Standards and Technology). It consists of just over 120 hours of English conversational telephone speech used as training data and test data in the 2003 Speaker Recognition Evaluation (SRE), along with evaluation metadata and test set answer keys.<br/>2003 NIST Speaker Recognition Evaluation is part of an ongoing series of yearly evaluations conducted by NIST. These evaluations provide an important contribution to the direction of research efforts and the calibration of technical capabilities. They are intended to be of interest to all researchers working on the general problem of text independent speaker recognition. To this end the evaluation was designed to be simple, to focus on core technology issues, to be fully supported, and to be accessible to those wishing to participate. <br/>This speaker recognition evaluation focused on the task of 1-speaker and 2-speaker detection, in the context of conversational telephone speech.&nbsp;&nbsp;The o&#114;iginal evaluation consisted of three parts: 1-speaker detection &#34;limited data&#34;, 2-speaker detection &#34;limited data&#34;, and 1-speaker detection &#34;extended data&#34;. This corpus contains training and test data and supporting metadata (including answer keys) for only the 1-speaker &#34;limited data&#34; and 2-speaker &#34;limited data&#34; components of the o&#114;iginal evaluation. The 1-speaker &#34;extended data&#34; component of the o&#114;iginal evaluation (not included in this corpus) provided metadata only, to be used in conjunction with data from Switchboard-2 Phase II (LDC99S79) and Switchboard-2 Phase III Audio (LDC2002S06). The metadata (resources and answer keys) for the 1-speaker &#34;extended data&#34; component of the o&#114;iginal 2003 SRE evaluation are available from the NIST Speech Group website for the 2003 Speaker Recognition Evaluation. <br/>The data in this corpus is a 120-hour subset of data first made available to the public as Switchboard Cellular Part 2 Audio (LDC2004S07), reorganized specifically for use in the 2003 NIST SRE.<br/>2003 NIST Speaker Recognition Evaluation is distributed on one DVD.<br/>2010 Subscription Members will automatically receive two copies of this corpus.&nbsp;&nbsp;2010 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$1000.<br/><br/>(2)&nbsp;&nbsp;ACE 2005 Mandarin SpatialML Annotations was developed by researchers at The MITRE Corporation (MITRE). ACE 2005 Mandarin SpatialML Annotations applies SpatialML tags to a subset of the source Mandarin training data in ACE 2005 Multilingual Training Corpus (LDC2006T06). <br/>SpatialML is a mark-up language for representing spatial e&#173;xpressions in natural language documents. SpatialML focuses is on geography and culturally-relevant landmarks, rather than biology, cosmology, geology, o&#114; other regions of the spatial language domain. The goal is to allow for better integration of text collections with resources such as databases that provide spatial information about a domain, including gazetteers, physical feature databases and mapping services. <br/>The SpatialML annotation scheme is intended to emulate earlier progress on time e&#173;xpressions such as TIMEX2, TimeML, and the 2005 ACE guidelines. The main SpatialML tag is the PLACE tag which encodes information about location. The central goal of SpatialML is to map location information in text to data from gazetteers and other databases to the extent possible by defining attributes in the PLACE tag. Therefore, semantic attributes such as country abbreviations, country subdivision and dependent area abbreviations (e.g., US states), and geo-coordinates are used to help establish such a mapping. The SpatialML guidelines are compatible with existing guidelines for spatial annotation and existing corpora within the ACE research program. <br/>This corpus consists of a 298-document subset of broadcast material from the ACE 2005 Multilingual Training Corpus (LDC2006T06) that has been tagged by a native Mandarin speaker according to version 2.3 of the SpatialML annotation guidelines, which are included in the documentation for this release. <br/>ACE 2005 Mandarin SpatialML Annotations&nbsp;&nbsp;is distributed via web download.<br/>2010 Subscription Members will automatically receive two copies of this corpus on disc.&nbsp;&nbsp;2010 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$500.<br/><br/>(3)&nbsp;&nbsp;NIST 2002 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2002 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems. The package was compiled and scoring software was developed by researchers at NIST, making use of newswire source data and reference translations collected and developed by LDC. <br/>The objective of the NIST OpenMT evaluation series is to support research in, and help advance the state of the art of, machine translation (MT) technologies -- technologies that translate text between human languages. Input may include all forms of text. The goal is for the output to be an adequate and fluent translation of the o&#114;iginal. Additional information about these evaluations may be found at the NIST Open Machine Translation (OpenMT) Evaluation web site. <br/>This evaluation kit includes a single perl script that may be used to produce a translation quality score for one (or more) MT systems. The script works by comparing the system output translation with a set of (expert) reference translations of the same source text. Comparison is based on finding sequences of words in the reference translations that match word sequences in the system output translation.<br/>The Chinese-language source text included in this corpus is a reorganization of data that was initially released to the public as Multiple-Translation Chinese (MTC) Part 2 (LDC2003T17). The Chinese-language reference translations are a reorganized subset of data from the same MTC corpus. The Arabic-language data (source text and reference translations) is a reorganized subset of data that was initially released to the public as Multiple-Translation Arabic (MTA) Part 1 (LDC2003T18). All source data for this corpus is newswire text. <br/>For each language, the test set consists of two files, a source and a reference file. Each reference file contains four independent translations of the data set. The evaluation year, source language, test set, version of the data, and source vs. reference file are reflected in the file name.<br/>NIST 2002 Open Machine Translation (OpenMT) Evaluation is distributed via web download.&nbsp;&nbsp;<br/>2010 Subscription Members will automatically receive two copies of this corpus on disc.&nbsp;&nbsp;2010 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$150.<br/>--------------------------------------------------------------------------------<br/>Ilya Ahtaridis<br/>Membership Coordinator<br/>--------------------------------------------------------------------<br/>Linguistic Data Consortium&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Phone: (215) 573-1275<br/>University of Pennsylvania&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Fax: (215) 573-2175<br/>3600 Market St., Suite 810&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ldc@ldc.upenn.edu<br/>Philadelphia, PA 19104 USA&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a href="http://www.ldc.upenn.edu" target="_blank" rel="external">http://www.ldc.upenn.edu</a>]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=165</link>
			<title><![CDATA[新书推荐：多媒体智能教学系统研究与设计]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[对外汉语]]></category>
			<pubDate>Sat,03 Apr 2010 21:53:09 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=165</guid>
		<description><![CDATA[媒体智能教学系统研究与设计<br/>作者：周晓军<br/>出版社：北京语言大学出版社<br/>出版日期：2009-10<br/>ISBN：978-7-5619-2468-6<br/>页数：218<br/>开本：16开<br/>版次：一版一次<br/>装祯方式：平装<br/>内容提要：<br/>&nbsp;&nbsp;&nbsp;&nbsp;本书提供了基于SC方法的复杂知识划一表示方法和细粒度知识模型构建方法，在简化知识处理难度、加速知识整理速度的同时，能守知识点结构化层次属性标注，促进了ITS系统复杂知识之间广泛语义联系的实现；创造性地提出了面向ITS的知识树增长模型，通过知识树动态构建、知识点结构和语义的映射过程，实现学生模型和教师模型动态一体化构建，并与动态教学规划和知识学习过程完全集成；系统采用面向对象的设计方法以及运用软件代理的策略，使得系统结构清晰，智能良好，系统的智能化功能添加或改进更加方便；通过层次知识索引实现知识存储与知识调用无关，在保证知识有机联系的同时，使得知识可以动态智能化添加和系统维护，保证对学生错误的正确定位和实时诊断，便利知识智能化查询；通过基于SC文法规则机制的智能推理方法，实现了启发式的知识搜索和调用，开展教学过程的智能化控制和实现；开放的软件结构和设计方法，便利了系统的扩展；在系统的学习规划以及系统体系化开发上，运用了机器学习机制和概率统计分析方法；通过知识内容与知识学习无关策略保证了系统领域无关的特性，使得系统可以作为一个通用的开发环境，进行广泛的多媒体智能教学系统的进一步研究和系列化开发。在本书的这些研究工作中，系统性地融合了遗传算法的基本方法和进化计算的设计思想，通过知识点学习循环和知识树映射，促进了系统的自适应持续优化，保证了系统的个别化最优化智能教学过程。<br/>作者简介：<br/>　　周晓军，男，籍贯陕西，工学博士，目前在北京语言大学网络教育学院工作，硕士研究生导师。研究方向为智能教学系统、现代远程教育、电子商务等，参加多个国家级科研项目，先后发表学术论文30余篇，参编多部学术专著。<br/>目录：<br/>第一章　引论<br/>1.1　ITS与相关学科研究<br/>1.2　发展现状与存在的问题<br/>1.3　研究内容与选题理由<br/>1.4　研究思路与研究特点<br/>第二章　研究综述<br/>　2.1 ITS研究背景<br/>2.1.1 CAl <br/>2.1.2 ITS<br/>2.1.3 智能教育网站与远程多媒体教学系统<br/>　2.2　ITS的研究内容<br/>2.2.1 概念理解<br/>2.2.2 结构组成<br/>2.2.3 ITS研究的主要问题<br/>　2.3　国内外ITS研究状况<br/>2.3.1 国内外研究工作总结<br/>2.3.2　典型系统介绍<br/>2.3.3　ITS国际会议<br/>　2.4　ITS发展思考<br/>2.4.1 计算机在教育中的应用趋势分析<br/>2.4.2　ITS学科发展<br/>第三章　领域无关的多媒体ITS<br/>3.1　问题的提出<br/>3.2　研究思路和研究方法<br/>3.2.1　机器翻译与ITS <br/>3.2.2　SC文法体系<br/>3.2.3　智能多代理系统<br/>3.3　基于SC文法的知识表示<br/>3.4　系统智能结构设计<br/>3.5　智能推理机制<br/>3.6　基于sc文法的代理解释器运行实例<br/>3.7　智能教学策略<br/>3.8　智能知识组织<br/>3.9　教学实例分析<br/>3.9.1　知识树和知识森林<br/>3.9.2　知识点层次索引<br/>3.9.3　知识点属性标注<br/>3.9.4　知识树映射<br/>3.9.5　教学过程智能控制<br/>3.10　结论与展望<br/>第四章　ITS开发环境<br/>4.1　要点探索<br/>4.2　平台主要内容介绍<br/>4.3 系统开发平台的面向对象的方法<br/>4.3.1　知识点对象<br/>4.3.2　知识树类<br/>4.3.3　教学策略对象<br/>4.3.4　规则对象<br/>4.3.5　情景案例对象<br/>4.4　库命令函数<br/>4.5　平台实现<br/>4.6　平台的通用性和扩展性<br/>……<br/>第五章　学生建模　<br/>第六章　教师模型与错误诊断<br/>第七章　MIMBATS总体设计　<br/>第八章　MIMBATS领域专家<br/>第九章　多媒体智能人机接口<br/>第十章　基于VRML的网络情景模拟教学<br/>第十一章　VRML常用建模方法<br/>第十二章　网络协同式智能教学<br/>第十三章　国内外部分网络教育平台介绍<br/>第十四章　E-Chinese汉语网平台的设计与实现<br/>第十五章　结论与展望<br/>参考文献<br/>]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=164</link>
			<title><![CDATA[新书推荐：留学生汉语输出学习策略研究]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[对外汉语]]></category>
			<pubDate>Sat,03 Apr 2010 21:48:52 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=164</guid>
		<description><![CDATA[留学生汉语输出学习策略研究<br/>作者：钱玉莲<br/>出版社：世界图书出版公司北京公司<br/>出版日期：2009年12月<br/>ISBN：978-7-5100-1160-3<br/>页数：205<br/>开本：16开<br/>版次：一版一次<br/>装祯方式：平装<br/>开本：16<br/>原价：26.00<br/>目录<br/>绪论<br/>第一节 国内外研究现状述评<br/>第二节 输出学习策略研究的意义<br/>第三节 研究的目标和主要内容<br/>第四节 研究的基本思路和方法<br/><br/>第一章 汉语输出学习策略研究的理论基础<br/>第一节 语言输出假设研究回顾<br/>第二节 关于输出的功能<br/>第三节 输出假设与二语习得<br/><br/>第二章 汉语输出学习策略量表的制作<br/>第一节 基于教学的第二语言学习策略分类体系<br/>第二节 汉语口头输出学习策略量表的制作<br/>第三节 汉语书面输出学习策略量表的制作<br/><br/>第三章 留学生汉语口头输出学习策略研究<br/>第一节 研究目标<br/>第二节 研究方法<br/>第三节 研究结果<br/>第四节 讨论与结论<br/><br/>第四章 留学生汉语书面输出学习策略研究<br/>第一节 研究目标<br/>第二节 研究方法<br/>第三节 研究结果<br/>第四节 讨论与结论<br/><br/>第五章 中外学生汉语书面输出学习策略比较研究<br/>第一节 研究目标<br/>第二节 研究方法<br/>第三节 研究结果<br/>第四节 讨论与结论<br/><br/>第六章 对外汉语输出技能教学对策研究<br/>第一节 对外汉语输出技能教学现状分析<br/>第二节 对外汉语口语教学对策<br/>第三节 对外汉语写作教学对策<br/>第四节 对外汉语输出学习策略培训<br/>结语<br/>附录<br/>参考文献<br/>后记<br/><br/>作者介绍<br/>&nbsp;&nbsp;&nbsp;&nbsp;钱玉莲，女，汉族，出生于江苏昆山，文学博士。现为南京师范大学国际文化教育学院教授，对外汉语专业语言习得理论与应用方向硕士生导师。<br/>&nbsp;&nbsp;&nbsp;&nbsp;从事对外汉语教学与研究工作近20年。1997年2月至1999年2月受教育部委派到前南斯拉夫贝尔格莱德大学语言学院担任汉语教师，2002年9月至2003年7月作为交换教授到韩国淑明女子大学教授本科生汉语课程和研究生课程，2004年4月受国家汉办委派到印度尼西亚做少儿汉语水平考试预测（YCT）的督考工作，2004年11月至12月受国务院侨务办公室委派到美国纽约、华盛顿、洛杉矶、芝加哥等地执行中文师资培训任务。<br/>&nbsp;&nbsp;&nbsp;&nbsp;主要研究兴趣为对外汉语教学研究、汉语第二语言习得研究、民俗文化研究。已出版专著4部，在国内外学术期刊上发表论文20多篇，参编或主编并出版汉语语言教材4部，中国历史教材1部。]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=163</link>
			<title><![CDATA[新书推荐：新思域下的汉语课堂：“以学生为中心”的对外汉语教学探索]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[对外汉语]]></category>
			<pubDate>Sat,03 Apr 2010 21:33:22 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=163</guid>
		<description><![CDATA[新思域下的汉语课堂：“以学生为中心”的对外汉语教学探索<br/>丛书名：文治堂学术专著<br/>编者：李柏令<br/>出版社：上海交通大学出版社<br/>页码：202页<br/>出版日期：2010年02月<br/>ISBN：7313061595/9787313061591<br/>版本：第1版<br/>装帧：平装<br/>开本：16<br/>正文语种：中文<br/>内容简介<br/>&nbsp;&nbsp;&nbsp;&nbsp;《新思域下的汉语课堂：“以学生为中心”的对外汉语教学探索》围绕着“以学生为中心”的主题概念，分别从对外汉语教学的总体设计、教材编写、教学模式与方法以及测试评估等几大方面着手探讨研究。同时结合当今对外汉语教学的发展现状和存在问题，探索一条全新的对外汉语教学之路。 <br/>序言<br/>&nbsp;&nbsp;&nbsp;&nbsp;作为国家的一个重要职能机构，现代的大学承担着专业教学与培养人才、学术研究与实验、文化传承与创新等内在功能。大学教育的实施者大学教师，除了必须履行教学职责，还需要相应承担科学研究的任务，而科学研究体现于人文社科领域的教师，则是著书立说，将政治、经济、文化、社会等作为研究对象进行思考、解析并阐释。<br/>&nbsp;&nbsp;&nbsp;&nbsp;一所大学的学术精神和灵魂与它所处的民族文化氛围息息相关。在现代意义的大学在中国兴起之前，处于社会阶层“四民”之首的“士”的价值选择一般都遵循着自孔子以来所树立的传统，要求谨守入则孝、出则悌等基本准则，在不断的践行中到达内圣外王的圆满境界，至于为学，则是所谓“行有余力，则以学文”，并不作为首要与最高要求。因道不行于诸侯，孔子述而不作，删订六经，为后世学人开创了绵延不绝的道统与学术传统。以今天的学术分类来看，自汉代起的对中国经典的接受、解读与各种诠释，无论是“我注六经”还是“六经注我”的方式，都可以算作是人文社科领域意义上的学术研究与探索。这种接受与研究经典的过程与行为，不同于后来纯粹的不掺杂价值判断的为学问而为学，它也不是为晋升官僚阶层而必须掌握的敲门砖，因为这些原典所包含的观念、规制、原理、政事、风俗、仪式、情感，等等，关乎整个国家之建构、天下之治乱，进入了这些原典的世界，就意味着为学是以天下治乱为己任，因而，无论是两汉经学还是宋明理学中的名臣、大儒或学者，都一方面是“博学于文”，另一方面是“行己有耻”，重风议、严取与，不在事功中安身立命，则在讲学中存亡继绝。在他们身上，为学与讲学、道统与学统、教育与学问修养合而为一，所以，过去对师者的定位是“传道、授业、解惑也”，并不对个人的学术精深程度作明确要求，为学或研究过程和所为之学术是追寻的手段或工具，“道”才是终极目的。<br/>&nbsp;&nbsp;&nbsp;&nbsp;中国现代大学在近代的兴起一方面是西方的刺激影响，另一方面则可以看作中国传统学术与教育自身的发展道路前进的可能结果，此时对传统学术的研究总结与重新阐释，带来的不仅是社会的变革，也是思想、教育的转型与创新。 <br/>目录<br/>绪论“以学生为中心”的对外汉语教学<br/>0.1 “以学生为中心”的教育理念及其发展<br/>0.1.1 “以学生为中心”的教育理念<br/>0.1.2 “以学生为中心”的教育理念在西方的发展<br/>0.1.3 “以学生为中心”的教育理念在我国的发展<br/>0.2 “以学生为中心”的语言教育观<br/>0.2.1 “以学生为中心”的语言教育观的发展<br/>0.2.2 “以学生为中心”的语言教育观在中国<br/>0.3 “以学生为中心”的汉语第二语言教育观<br/>0.3.1 “以学生为中心”的对外汉语教育观<br/>0.3.2 “以学生为中心”的少数民族汉语教学研究<br/>0.3.3 “以学生为中心”的对外汉语教学研究现状<br/>0.4 “以学生为中心”的对外汉语教学和研究的意义<br/>0.4.1 确立“以学生为中心”的教育思想的主流地位<br/>0.4.2 填补“以学生为中心”的相关理论空白<br/>0.4.3 运用“以学生为中心”的思想指导教学<br/>0.5 “以学生为中心”的对外汉语教学研究的内容<br/>0.5.1 “以学生为中心”的对外汉语教学理论研究<br/>0.5.2 “以学生为中心”的对外汉语教学实践研究<br/>0.6 建构主义学习理论与对外汉语教学<br/>0.6.1 关于建构主义学习理论<br/>0.6.2 建构主义的对外汉语教学模式与教学方法<br/>0.6.3 建构主义学习环境下的对外汉语教学设计<br/><br/>第一章 “以学生为中心”的对外汉语教学总体设计<br/>1.1 基于需求分析的对外汉语教学总体设计<br/>1.1.1 “以学生为中心”的课程设计与需求分析<br/>1.1.2 教学对象分析：总体设计的起点<br/>1.1.3 信息采集与分班原则<br/>1.2 零起点商务汉语课程设计<br/>1.2.1 关于零起点商务汉语教学的研究<br/>1.2.2 否定零起点商务汉语教学的“基础论”观点的若干弊端<br/>1.2.3 从商务汉语的本质看零起点商务汉语教学的理论基础<br/>1.2.4 开展零起点商务汉语教学的思路和基本观点<br/>1.3 海外汉语教学课程总体设计研究<br/>1.3.1 从市场细分角度进行海外汉语教学课程研究与设计<br/>1.3.2 汉语消费品与汉语市场营销<br/>1.3.3 海外汉语教学情况的具体分析与对比<br/>1.3.4 “以学生为中心”的需求组合型对外汉语教学设计<br/><br/>第二章 “以学生为中心”的对外汉语教材编写<br/>2.1 为英语背景来华留学生编写汉语初级教材的新思路<br/>2.1.1 英语背景学生学习汉语的特点和现行汉语教材中的问题<br/>2.1.2 “以学生为中心”的教材编写原则和实践<br/>2.2 面向海外市场的汉语短期教材编写实践<br/>2.2.1 针对马来西亚的短期教材《生活汉语》<br/>2.2.2 《生活汉语》的编写原则<br/>2.3 基于语体的对外汉语中高级精读教材编写研究<br/>2.3.1 对中高级精读教材语体要素的调查与分析<br/>2.3.2 关于中高级精读教材编写的设想<br/>2.3.3 语体色彩的标注、讲解和辨析<br/>2.3.4 语体技能的练习<br/>2.3.5 语体知识的系统介绍<br/>2.4 初中级对外汉语教材中同译词现象考察与研究<br/>2.4.1 什么是“同译词<br/>2.4.2 与同译词现象相关的应用研究<br/>2.4.3 考察范围与研究方法<br/>2.4.4 同译词考察、分类及原因分析<br/>2.4.5 同译词对词语教学与学习的影响<br/>2.4.6 关于解决同译词问题的思考<br/><br/>第三章 “以学生为中心”的对外汉语教学模式和教学方法<br/>3.1 基于内容的多媒体汉语教学研究<br/>3.1.1 “基于内容的多媒体语言教学<br/>3.1.2 基于内容的多媒体汉语教学<br/>3.2 中级水平日韩、欧美留学生学习策略研究<br/>3.2.1 有关留学生学习策略的一项研究<br/>3.2.2 实验结果与讨论<br/>3.2.3 对教学的启示和建议<br/>3.3 对外汉语教学中的“讨论式教学”<br/>3.3.1 对外汉语教学中进行“讨论式教学”的必要性<br/>3.3.2 “讨论式教学”的讨论形式<br/>3.3.3 教师在实行“讨论式教学”活动中的主导作用<br/>3.4 “以学生为中心”的汉语个别教学案例研究<br/>3.4.1 一对一个别教学是对外汉语教学的客观需要<br/>3.4.2 一对一教学的设计与实验<br/><br/>第四章 “以学生为中心”的对外汉语教学测试与评估<br/>4.1 “以学生为中心”的语言测试理念<br/>4.1.1 “以学生为中心”的语言测试<br/>4.1.2 “以学生为中心”的汉语测试<br/>4.2 “以学生为中心”的汉语口语测试<br/>4.2.1 测试的原则<br/>4.2.2 初、中、高级汉语口语试卷分析<br/>4.2.3 评分的标准<br/>附录]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=162</link>
			<title><![CDATA[新书推荐：基于语料库的英语语言学语体分析]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[语言学]]></category>
			<pubDate>Sat,03 Apr 2010 21:29:42 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=162</guid>
		<description><![CDATA[基于语料库的英语语言学语体分析<br/>作者：桂诗春<br/>出版社：外语教学与研究出版社<br/>出版日期：2009-12<br/>ISBN：978-7-5600-9175-4<br/>页数：123<br/>开本：16开<br/>版次：一版一次<br/>装祯方式：平装<br/>内容提要：<br/>&nbsp;&nbsp;&nbsp;&nbsp;著名应用语言学家桂诗春教授集多年语言学教学与研究之经验，在本书中以“使用语料库和语料库过程来发现那些帮助我们弄懂语言怎样建立语篇的方式的语言型式”为主要目的，深入地分析了面向语言学和应用语言学教学的“英语语言学语料库”。<br/>&nbsp;&nbsp;&nbsp;&nbsp;研究通过100万词的语言学样本归纳出其语言特征，供我国英语教师和研究生阅读语言学著作和撰写语言学论文作参考。并附有光盘，包含词频排列表、词频分布表、超用词和少用词表等。<br/>&nbsp;&nbsp;&nbsp;&nbsp;本书是语言学及应用语言学专业的一线教师、研究生、语料库研究者、语篇研究者等不可多得的参考资料。 <br/>作者简介：<br/>&nbsp;&nbsp;&nbsp;&nbsp;桂诗春，广东外语外贸大学外国语言学与应用语言学研究中心教授、博士生导师，扭任中国外语教学研究会副会长、广东外语学会会长等职务。历任国务院学位委员会第二、三届学科评议组成员，广东省第五、六、七届人民代表大会代表，中山大学和广州外国语学院系副主任、系主任、院长等职。 主要研究方向为应用语言学、心理语言学、语料库语言学、语言测试等，著有《心理语言学》、《标准化考试——理论、原則与方法》、《应用语言学》、《应用语言学与中国英语教学》、《实验心理语音学纲要》、《中国学生英语学习心理》、《语言学方法论》(合著)、《新编心理语言学》、《应用语言学研究》、《中国学习者英语语料库》(合著)、《基于CLEC语料库的中国学习者英语分析》(合编)等。 <br/>目录：<br/>前言<br/>1 研究背景<br/>2 研究目的与方法<br/>2.1 基本统计分析<br/>2.2 多特征／多维度方法<br/>2.3 关键性方法<br/>3 基本统计分析<br/>3.1 词频分析<br/>3.2 语料库的词汇密度<br/>3.3 平均词长<br/>3.4 覆盖面<br/>3.5 罕用词<br/>3.6 句子长度<br/>3.7 词类<br/>3.8 小结<br/>4 语法特征分析<br/>4.1 语法特征<br/>4.2 因子分析<br/>4.3 词汇语法<br/>4.3.1 名词化<br/>4.3.2 名词<br/>4.3.3 现在时<br/>4.3.4 被动式<br/>4.3.5 过去分词省略Wh-式<br/>4.3.6 介词<br/>4.3.7 连接式<br/>4.3.8 修饰方式<br/>4.3.9 分裂辅助词<br/>4.3.10 无人称<br/>4.3.11 情态词<br/>4.4 小结<br/>5 词汇特征分析<br/>5.1超用词的特点<br/>5.1.1 名词居多<br/>5.1.2 功能词的使用<br/>5.1.3 凸显研究焦点<br/>5.2 语族<br/>5.2.1 语族的分布<br/>5.2.2 常用的语族<br/>5.3 搭配分析<br/>5.3.1 两条原则<br/>5.3.2 几个例子<br/>5.4 小结<br/>5.5 语言学语体的词汇特征<br/>5.5.1 定义性语言<br/>5.5.2 分类性语言<br/>5.5.3 分析性语言<br/>5.5.4 修饰性语言<br/>5.5.5 词汇包<br/>5.6 专用词汇表的特点<br/>5.6.1 名词和派生形容词<br/>5.6.2 功能词<br/>5.6.3 专有名词<br/>5.6.4 通用性词汇<br/>5.7 少用词<br/>5.7.1 人称代词<br/>5.7.2 动词过去时态<br/>5.7.3 缩约语<br/>5.7.4 和时间、年、月有关的词语<br/>5.7.5 和个人生活有关的词语<br/>5.7.6 和社会生活有关的词语<br/>5.8 小结<br/>6 研究结论与教学应用<br/>6.1 研究结论<br/>6.2 在教学中的应用<br/>参考文献]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=161</link>
			<title><![CDATA[新书推荐：英语学习者口语交际能力研究——语料库语言学视角]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[语言学]]></category>
			<pubDate>Sat,03 Apr 2010 21:26:25 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=161</guid>
		<description><![CDATA[英语学习者口语交际能力研究——语料库语言学视角<br/>作者：甄凤超<br/>出版社：上海交通大学出版社<br/>出版日期：2009-10<br/>ISBN：978-7-313-05575-0<br/>页数：270<br/>开本：16开<br/>版次：一版一次<br/>装祯方式：平装<br/>内容提要：<br/>&nbsp;&nbsp;&nbsp;&nbsp;本书从语料库语言学角度，着重探讨了中国学习者英语口语交际能力的四个重要成分：使用预构成语块的能力、通过目的语运用图式知识的能力、语用能力和策略能力。基于“大学学习者英语口语语料库”，本书从上述四个纬度对比分析了不同口语交际能力水平的学习者之间及与英语母语者之间的差异，发现中国学习者英语口语的特征和问题。书中所提出的观点、概念、方法以及研究发现为学习者英语研究、二语习得研究、外语教学研究提供了有价值的信息。 <br/>目录：<br/>Chapter 1 Introduction<br/>1.1 Motivations for the book<br/>1.2 Research objectives<br/>1.3 Outline of the book<br/>Chapter 2 Critical Discussion of the Previous Studies on Learner Language<br/>2.1 Changes in current linguistic and SLA research<br/>2.2 Main features of CLC data<br/>2.3 Methodological approaches to CLC research<br/>2.4 Studies based on learner corpora<br/>2.5 Conversation analysis<br/>2.6 Communicative competence<br/>2.7 Conclusion<br/>Chapter 3 Defining o&#114;al Communicative Ability<br/>3.1 Features of o&#114;al communication<br/>3.2 Accuracy, fluency and appropriacy<br/>3.3 Four essential components<br/>3.4 Conclusion<br/>Chapter 4 Research Methodology<br/>4.1 The COLSEC<br/>4.2 The data in the book<br/>4.3 Methods for data extraction and processing<br/>4.4 Summary<br/>Chapter 5 Findings and Analysis (1): Productive Vocabulary Size and the Ability to Use PCs<br/>5.1 Productive vocabulary size<br/>5.2 The ability to use PCs<br/>5.3 Summary<br/>Chapter 6 Findings and Analysis (2): the Ability to Manipulate Schemata<br/>6.1 Introduction<br/>6.2 Key words for the Use of Computers<br/>6.3 Key words for Going on Tour<br/>6.4 Discussion and conclusion<br/>Chapter 7 Findings and Analysis (3): Pragmatic Competence and Strategic Competence<br/>7.1 Pragmatic markers<br/>7.2 Interruptions and overlaps<br/>7.3 Strategies for taking, holding and yielding turns<br/>7.4 Reactive tokens<br/>7.5 Summary<br/>Chapter 8 Conclusion<br/>8.1 Summary of findings<br/>8.2 Implications for pedagogy<br/>8.3 Limitations of the book<br/>8.4 Directions for further researches<br/>Appendices<br/>Appendix A An error classification system<br/>Appendix B Error types in three groups<br/>Appendix C Collocates of IMPROVE in the position of n in the pattern V n from the SBNC<br/>Appendix D Collocates of KNOWLEDGE in the pattern v N from the SBNC]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=160</link>
			<title><![CDATA[新书推荐：系统与语料：二者关联探索]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[语言学]]></category>
			<pubDate>Sat,03 Apr 2010 21:23:07 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=160</guid>
		<description><![CDATA[系统与语料：二者关联探索（英文影印版）<br/>原书名： System And Corpus: Exploring Connections <br/>原出版社： David Brown Book Company <br/>作者： (英)Geoff Thompson,&nbsp;&nbsp;Susan Hunston&nbsp;&nbsp;<br/>丛书名： 西方语言学前沿书系 <br/>出版社：世界图书出版公司 <br/>ISBN：9787510014185 <br/>定价 ： ￥45.00<br/>出版日期：2010 年1月 <br/>开本：24 <br/>页码：326 <br/>版次：1-1<br/>&nbsp;&nbsp;&nbsp;&nbsp;本书所收论文从不同角度探讨了系统功能语言学与语料库语言学的协同关系，从中我们可以看到当前国外语言学研究中对这个重要问题的思考。论文来自英国、德国、意大利、澳大利亚、奥地利、日本等国从事系统功能语言学和语料库语言学研究的专家，系统功能语言学的代表人物M．A．K．Hamday为全书写了后记。<br/>&nbsp;&nbsp;&nbsp;&nbsp;本书适合从事系统功能语言学和语料库语言学研究，以及对二者感兴趣的相关人士阅读。 <br/>本书提供作译者介绍<br/>&nbsp;&nbsp;&nbsp;&nbsp;Geoff Thompson任教于英国利物浦大学，曾在中国等多个国家教授英语及应用语言学课程。他的主要著作有Introducing Functional Grammar(《功能语言学入门》)等。<br/>&nbsp;&nbsp;&nbsp;&nbsp;Susan Hunston是英国伯明翰大学的教授，具有丰富的语言教学和应用语言学研究经验，是语料库分析领域公认的专家。她的主要著作有Corpora in Applied Linguistics(《应用语言学中的语料库》)等。<br/>目录<br/>《语料库与计算语言学研究丛书》序 <br/>《系统与语料——二者关联探索》导读 <br/>原书目录 <br/>前言 <br/>供稿者名录 <br/>1 导论 <br/>系统与语料库：建立在共同基础上的两个传统 <br/>2 语料分析：学科现状与未回答问题的三个类型 <br/>3 作为选择的语言：什么是选择? <br/>4 熟语选词与系统：对于争论的一个贡献 <br/>5 在语料中注入系统：论语料库与系统功能语言学的关系 <br/>6 某些基本语法系统的频度侧面：中期研究报告 <br/>7 语料数据与系统网络的“匹配”：使用语料库修改并扩充英语的“及物性”网络 <br/>8 多模态语料库语言学 <br/>9 在系统功能语言学中怎样处理词汇语义：基于语料库的范围形容词意图研究 <br/>10 日语中“痛苦”的表达方法 <br/>11 基于语料库的泰语连接词：研究：文本资源的利用 <br/>12 从索引行到文本：在小型语料库Alma Mater关于赠送的提问中对于giving的评价 <br/>13 旅游网页中与文化相关的语言差异：激情与事实，在Appraisal框架内的语料分析 <br/>14 后记 <br/>参考文献 <br/>索引 ]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=159</link>
			<title><![CDATA[对外汉语多媒体教学课件设计---教学大纲]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[讲授课程]]></category>
			<pubDate>Mon,22 Feb 2010 18:02:21 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=159</guid>
		<description><![CDATA[课程名称：对外汉语多媒体教学课件设计<br/>英文名称：Designing Multimedia Courseware for TCSOL<br/>教学对象：2009级汉语国际教育硕士（海外生）<br/>开课学期：2009-2010学年春季学期<br/>周学时/总学时：2/34<br/>学分：2<br/>教学方式：课堂讲授、课件演示与设计例释<br/>考试方式：课件设计与制作（包括课件及其设计报告）<br/>内容提要：<br/>&nbsp;&nbsp;&nbsp;&nbsp;本课程的主要教学内容包括汉语多媒体教学课件设计的基本概念和基础知识、汉语多媒体教学课件制作的常用技术、汉语多媒体教学课件设计解析、汉语多媒体教学课件资源及应用等。<br/>&nbsp;&nbsp;&nbsp;&nbsp;The main contents include some basic concepts and rudiments of multimedia courseware design for TCSOL, commonly-used technologies in multimedia courseware development, exemplification and demonstration of TCSOL multimedia courseware design, and resources and application of TCSOL multimedia courseware.<br/>教学目标：<br/>&nbsp;&nbsp;&nbsp;&nbsp;本课程通过课堂讲授、课件演示与设计例释，帮助学生了解汉语多媒体教学课件设计的基本概念和基础知识，着重帮助学生掌握汉语多媒体教学课件制作的常用技术和技巧，培养学生设计和制作汉语多媒体课件辅助汉语教学的实际能力。<br/>Objectives:<br/>&nbsp;&nbsp;&nbsp;&nbsp;Through lectures, demonstrations and exemplifications, this course aims at familiarizing students with some basic concepts and rudiments of multimedia courseware design for TCSOL, with focus on enabling them to have a good command of commonly-used technologies in multimedia courseware development, and cultivating their practical ability to design and develop multimedia courseware to assist Chinese teaching.<br/>教材：<br/>&nbsp;&nbsp;&nbsp;&nbsp;郑艳群编著，汉语多媒体教学课件设计，北京语言文化大学出版社2009年出版。<br/>参考书：<br/>郑艳群，计算机技术与世界汉语教学，外语教学与研究出版社2008年出版。<br/>崔永华编著，对外汉语教学设计导论，北京语言大学出版社2008年出版。<br/>胡隆主编，计算机辅助外语教学——多媒体和网络的应用，上海外语教育出版2001年出版。<br/>张森、宗绪锋编著，多媒体CAI课件基本原理与制作技术，北京航空航天大学出版社2000年出版。<br/>周恕义主编，多媒体CAI开发实用教程，中国水利水电出版社1999年出版。<br/>Heinich, Robert, et al. Instructional Media and Technologies for Learning (7th ed.). Pearson Education/Higher Education Press. 2002.]]></description>
		</item>
		
			<item>
			<link>http://www.luweixmu.com/home/article.asp?id=158</link>
			<title><![CDATA[New Publications of Linguistic Data Consortium]]></title>
			<author>luwig@xmu.edu.cn(admin)</author>
			<category><![CDATA[语言学]]></category>
			<pubDate>Tue,26 Jan 2010 15:28:41 +0800</pubDate>
			<guid>http://www.luweixmu.com/home/default.asp?id=158</guid>
		<description><![CDATA[In this newsletter:<br/>- Newly Expanded Press Release Section - <br/>- Upcoming LDC Institute Seminar -<br/>New Publications:<br/>LDC2010T02<br/>- Czech Broadcast News MDE Transcripts -<br/>LDC2010T03<br/>- GALE Phase 1 Chinese Newsgroup Parallel Text - Part 2 -<br/>LDC2010T01<br/>- NIST Open Machine Translation 2008 Evaluation (MT08) Sel&#101;cted Reference and System Translations -<br/>--------------------------------------------------------------------------------<br/>Newly Expanded Press Release Section<br/><br/>Recall reading a newsletter article about the Reduced Licensing Fee but unsure what you did with the email?&nbsp;&nbsp; Curious as to which o&#114;ganization was the recipient of LDC&#39;s 15,000th corpus distribution nearly eight years ago?&nbsp;&nbsp;If so, be sure to visit LDC&#39;s newly expanded Press Release section on our What&#39;s New! What&#39;s Free! page to read about these topics and more.&nbsp;&nbsp;The Press Release section includes the articles of previous newsletters as well as major announcements from LDC.&nbsp;&nbsp;Information is o&#114;ganized into the following categories:<br/><br/>15th Anniversary Monthly Spotlight Archive - as part of our 15th Anniversary celebration in 2007, we highlighted one aspect of the LDC in our monthly newsletters. These features provided our members and data users with a glimpse of the broad range of LDC’s research activities. <br/><br/>Conference Attendance by LDC - recent publisher displays and conference participation by LDC.<br/><br/>Etc. - recent collaborations and grant awards plus other announcements.<br/><br/>Membership Mailbag Archive - to address the questions that our data users have asked, we introduced our Membership Mailbag series of newsletter articles in May 2008. This periodic series addresses frequently asked questions about LDC data, the LDC Intranet, and the benefits of an LDC membership.<br/><br/>Member Surveys - LDC conducted two end-of-year surveys to obtain feedback on satisfaction levels with LDC Membership and data releases as well as our corpus catalog, and to gather suggestions on future publications.<br/><br/>Milestones and Celebrations - information on our landmark corpora distributions and events to celebrate our 10th and 15th anniversary years.<br/><br/>Use of LDC Corpora in University Summer Schools - ways LDC corpora have been used for teaching purposes at university summer school programs.<br/><br/>The Press Release section will be up&#100;ated as new announcements are made so we anticipate that this will be a great resource for information about LDC.<br/><br/>- Upcoming LDC Institute Seminar -<br/><br/>The LDC Institute will hold its next session on&nbsp;&nbsp;Tuesday, January 26, 2010, from 10:00 a.m. to 12:00 p.m.&nbsp;&nbsp;in the LDC Conference Room at LDC&#39;s Philadelphia offices, 3600 Market Street, Suite 810. <br/><br/>The topic of this session will be the U.S. Supreme Court Corpus (SCOTUS) presented by Daniel Katz, J.D., M.P.P., Fellow in Empirical Legal Studies, Michigan Law School, PhD Candidate, Political Science and Public Policy, University of Michigan, and Michael Bommarito, PhD Student, Political Science: Methods &amp; Modeling, University of Michigan. <br/><br/>ABSTRACT: <br/>The corpus of Supreme Court written opinions is a rich linguistic resource. Not only does this corpus provide a longitudinal sample of formal American English, but it is also a source of text with identified authors and vote-coded sentiment. Despite this value and years of qualitative and quantitative material of the United States Supreme Court, no compiled corpus of these opinions is currently available to researchers. The purpose of this talk is (1) to describe efforts to compile both the complete corpus of Supreme Court Opinions and associated metadata, (2) to outline a number of our current research projects utilizing this data, and (3) to discuss any criticism, potential projects, o&#114; possible collaboration. <br/><br/>Refreshments will be provided. If you are in the area, we hope to see you there! <br/><br/>New Publications<br/><br/>(1)Czech Broadcast News MDE Transcripts was prepared by researchers at the University of West Bohemia, Pilsen, Czech Republic. It consists of metadata extraction (MDE) annotations for the approximately 26 hours of transcribed broadcast news speech in Czech Broadcast News Transcripts (LDC2004T01). The audio files corresponding to the transcripts in this corpus are contained in Czech Broadcast News Speech (LDC2004S01). Czech Broadcast News MDE Transcripts joins LDC&#39;s other holdings of Czech broadcast data: Czech Broadcast Conversation Speech (LDC2009S02), Czech Broadcast Conversation MDE Transcripts (LDC2009T20), Voice of America (VOA) Czech Broadcast News Audio (LDC2000S89) and Voice of America (VOA) Czech Broadcast News Transcripts (LDC2000T53). <br/><br/>The audio recordings were collected from February 1, 2000 through April 22, 2000 from three Czech radio stations and two television stations. The broadcasts included both public and commercial subjects and were presented in various styles, ranging from a formal style to a colloquial style more typical for commercial broadcast companies that do not primarily focus on news. <br/><br/>The goal of MDE research is to take raw speech recognition output and refine it into forms that are of more use to humans and to downstream automatic processes. In simple terms, this means the creation of automatic transcripts that are maximally readable. This readability might be achieved in a number of ways: removing non-content words like filled pauses and discourse markers from the text; removing sections of disfluent speech; and creating boundaries between natural breakpoints in the flow of speech so that each sentence o&#114; other meaningful unit of speech might be presented on a separate line within the resulting transcript. Natural capitalization, punctuation, standardized spelling and sensible conventions for representing speaker turns and identity are further elements in the readable transcript. <br/><br/>The transcripts and annotations in this corpus are stored in two formats: QAn (Quick Annotator), and RTTM. Character encoding in all files is ISO-8859-2.<br/><br/>Czech Broadcast News MDE Transcripts is distributed via web download.<br/><br/>2010 Subscription Members will automatically receive two copies of this corpus on disc. 2010 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$750.<br/><br/>(2) GALE Phase 1 Chinese Newsgroup Parallel Text - Part 2 was prepared by LDC and contains 223,000 characters (98 files) of Chinese newsgroup text and its translation sel&#101;cted from twenty-one sources. Newsgroups consist of posts to electronic bulletin boards, Usenet newsgroups, discussion groups and similar forums. This release was used as training data in Phase 1 (year 1) of the DARPA-funded GALE program. <br/><br/>Preparing the source data involved four stages of work: data scouting, data harvesting, formating and data sel&#101;ction.<br/><br/>Data scouting involved manually searching the web for suitable newsgroup text. Data scouts were assigned particular topics and genres along with a production target in o&#114;der to focus their web search. Formal annotation guidelines and a customized annotation toolkit helped data scouts to manage the search process and to track progress. <br/><br/>Data scouts logged their decisions about potential text of interest to a database. A nightly process queried the annotation database and harvested all designated URLs. Whenever possible, the entire site was downloaded, not just the individual thread o&#114; post located by the data scout. Once the text was downloaded, its format was standardized so that the data could be more easily integrated into downstream annotation processes. Typically, a new script was required for each new domain name that was identified. After scripts were run, an optional manual process corrected any remaining formatting problems.<br/><br/>The sel&#101;cted documents were then reviewed for content-suitability using a semi-automatic process. A statistical approach was used to rank a document&#39;s relevance to a set of already-sel&#101;cted documents labeled as &#34;good.&#34; An annotator then reviewed the list of relevance-ranked documents and sel&#101;cted those which were suitable for a particular annotation task o&#114; for annotation in general. These newly-judged documents in turn provided additional input for the generation of new ranked lists. <br/><br/>Manual sentence units/segments (SU) annotation was also performed as part of the transcription task. Three types of end of sentence SU were identified: statement SU, question SU, and incomplete SU. After transcription and SU annotation, files were reformatted into a human-readable translation format and assigned to professional translators for careful translation. Translators followed LDC&#39;s GALE Translation guidelines which describe the makeup of the translation team, the source data format, the translation data format, best practices for translating certain linguistic features and quality control procedures applied to completed translations. <br/><br/>GALE Phase 1 Chinese Newsgroup Parallel Text - Part 2 is distributed via web download.<br/><br/>2010 Subscription Members will automatically receive two copies of this corpus on disc. 2010 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$1500.<br/><br/>(3) NIST Open Machine Translation 2008 Evaluation (MT08) Sel&#101;cted Reference and System Translations.&nbsp;&nbsp;NIST Open MT is an evaluation series to support research in, and help advance the state of the art of, technologies that translate text between human languages. Participants submit machine translation output of source language data to NIST (National Institute of Standards and Technology); the output is then evaluated with automatic and manual measures of quality against high quality human translations of the same source data. This program supports the growing interest in system combination approaches that generate improved translations from output of several different machine translation (MT) systems. MT system combination approaches require data sets composed of high-quality human reference translations and a variety of machine translations of the same text. The NIST Open Machine Translation 2008 Evaluation (MT08) Sel&#101;cted Reference and System Translations set addresses this need. <br/><br/>The data in this release consists of the human reference translations and corresponding machine translations for the NIST Open MT08 test sets, which consist of newswire and web data in the four MT08 language pairs:&nbsp;&nbsp;Arabic-to-English, Chinese-to-English, English-to-Chinese (newswire only) and Urdu-to-English. Two documents per language pair and genre were removed at random from the test sets for release. For the machine translations, only output from one submission per training condition (Constrained and Unconstrained training, wh&#101;re available) per participant is included. See section 2 of the MT08 Evaluation Plan for a description of the training conditions. The resulting data set has the following characteristics: <br/><br/>Arabic-to-English: 120 documents with 1312 segments, output from 17 machine translation systems. <br/>Chinese-to-English: 105 documents with 1312 segments, output from 23 machine translation systems. <br/>English-to-Chinese: 127 documents with 1830 segments, output from 11 machine translation systems. <br/>Urdu-to-English: 128 documents with 1794 segments, output from 12 machine translation systems. <br/>The data is o&#114;ganized and annotated in such a way that subsets for each language pair and/or data genre and/or training condition can be extracted and used separately, depending on the user&#39;s needs.<br/><br/>NIST Open Machine Translation 2008 Evaluation (MT08) Sel&#101;cted Reference and System Translations is distributed via web download.<br/><br/>2010 Subscription Members will automatically receive two copies of this corpus on disc. 2010 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$200.<br/>-------------------------------------------------------------------------------<br/>Ilya Ahtaridis<br/>Membership Coordinator<br/>--------------------------------------------------------------------<br/>Linguistic Data Consortium&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Phone: (215) 573-1275<br/>University of Pennsylvania&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Fax: (215) 573-2175<br/>3600 Market St., Suite 810&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ldc@ldc.upenn.edu<br/> Philadelphia, PA 19104 USA&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <a href="http://www.ldc.upenn.edu" target="_blank" rel="external">http://www.ldc.upenn.edu</a>]]></description>
		</item>
		
</channel>
</rss>
