信息检索关键词部分

 

Key word

1

信息检索(Information Retrieval, IR      数据检索(data retrieval

相关性(relevance                      推送(Push                

超空间(hyperspace                    拉出(pulling

文献逻辑表示(视图)(logical view of the document

检索任务(retrieval task                   检索(retrieval

过滤(filtering                         全文本(full text

词干提取(stemming                    文本操作(text operation

标引词(indexing term                  信息检索策略(retrieval strategy

光学字符识别(Optical Character Recognition, OCR

跨语言(cross-language                  倒排文档(inverted file

检出文献(retrieved document            相关度(likelihood

信息检索的人机交互界面(human-computer interaction, HCI

检索模型与评价(Retrieval Model & Evaluation)文本图像(textual images 

界面与可视化(Interface & Visualization   书目系统(bibliographic system

多媒体建模与检索(Multimedia Modeling & Searching

数字图书馆(Digital Library              检索评价(retrieval evaluation

标准通用标记语言(Standard Generalized Markup Language, SGML

标引和检索(indexing and searching       导航(Navigation

并行和分布式信息检索(parallel and distribution IR

模型与查询语言(model and query language)导航(Navigation

有效标引与检索(efficient indexing and searching

 

2

特别检索(ad hoc retrieval     过滤(filtering     集合论(set theoretic             代数(algebraic                   概率(probabilistic     路由选择(routing

用户需求档(user profile           阙值(threshold      权值(weight 

语词加权(term-weighting          相似度(similarity   相异度(dissimilarity

域建模(domain modeling          叙词表(thesaurus   扁平(flat

广义向量空间模型(generalized vector space model          神经元(neuron    

潜语义标引模型(latent semantic indexing model            邻近结点(proximal node       

贝叶斯信任度网络(Bayesian belief network                结构导向(structure guided

结构化文本检索(structured text retrieval, STR          推理网络(inference network

扩展布尔模型(extended Boolean model              非重叠链表(non-overlapping list

 

3

检索性能评价(retrieval performance evaluation      会话(interactive session

查全率(R, Recall Ratio)                             信息性(Informativeness

查准率(P, Precision Ratio)                            面向用户(user-oriented

漏检率(O, Omission Ratio)                           新颖率(novelty ratio

误检率(M, Miss Ratio)                               用户负担(user effort

相对查全率(relative recall                         覆盖率(coverage ratio

参考测试集(reference test collection                优劣程度(goodness

查全率负担(recall effort                          主观性(subjectiveness 

信息性测度(informativeness measure

 

4

检索单元(retrieval unit     字母表(alphabet          分隔符(separator       

复合性(compositional      模糊布尔(fuzzy Boolean    模式(pattern

SQL(Structured Query Language, 结构化查询语言)   布尔查询(Boolean query   

参照(reference    半结合(semijoin            标签(tag                   

有序包含(ordered inclusion                      无序包含(unordered inclusion                                       

CCL(Common Command Language, 通用命令语言)    树包含(tree inclusion

布尔运算符(Boolean operator                    searching allowing errors容错查询

Structured Full-text                                relevance feedback 相关反馈

Query Language (SFQL) (结构化全文查询语言)     extended patterns扩展模式            

CD-RDx Compact Disk Read only Data exchange (CD-RDx)(只读磁盘数据交换)

WAIS (广域信息服务系统Wide Area Information Service) 

visual query languages. 查询语言的可视化               查询语法树(query syntax tree

 

5

query reformulation 查询重构 query expansion 查询扩展                                 term reweighting 语词重新加权                相似性叙词表(similarity thesaurus

User Relevance Feedback用户相关反馈         the graphical interfaces 图形化界面

簇(cluster   检索同义词(searchonym    local context analysis局部上下文分析

 

6

文献(document           样式(style       元数据(metadata

Descriptive Metadata 描述性元数据                Semantic Metadata 语义元数据

intellectual property rights 知识产权                content rating 内容等级

digital signatures数字签名                         privacy levels 权限

electronic commerce电子商务                     

都柏林核心元数据集(Dublin Core Metadata Element Set

通用标记语言(SGMLstandard general markup language                                

机读目录记录(Machine Readable Cataloging Record, MARC

资源描述框架(Resource Document Framework, RDF)                    XML(eXtensible Markup Language, 可扩展标记语言

HTMLHyperText Markup Language, 超文本标记语言)

Tagged Image File Format (TIFF标签图像文件格式)

Joint Photographic Experts Group (JPEG) Portable Network Graphics (PNG新型位图图像格式)

 

7

分隔符(separator                   连字符(hyphen

排除表(list of stopwords             词干提取(stemming

波特(porter                        词库(treasury of words

受控词汇表(controlled vocabulary     索引单元(indexing component

文本压缩text compression               压缩算法compression algorithm

注释(explanation                    统计方法(statistical method

赫夫曼(Huffman                    压缩比(compression ratio

数据加密Encryption                   半静态的(semi-static

词汇分析lexical analysis                排除停用词elimination of stopwords

 

8

半静态(semi-static191                   词汇表(vocabulary192           

事件表(occurrence192                   inverted files倒排文档     

suffix arrays后缀数组                      signature files签名档

块寻址(block addressing193              索引点(index point199

起始位置(beginning199                  Vocabulary search词汇表检索

Retrieval of occurrences 事件表检索          Manipulation of occurrences事件表操作

散列变换(hashing205                    误检(false drop205

查询语法树(query syntax tree207           布鲁特-福斯算法简称BFBrute-Force

故障(failure210    移位-或(shift-or    位并行处理(bit-parallelism212

顺序检索(sequential search220            原位(in-place227

 

9

并行计算(parallel computing           SISD (单指令流单数据流)

SIMD (单指令流多数据流)             MISD (多指令流单数据流)

MIMD (多指令流多数据流)            分布计算(distributed computing

颗粒度(granularity231                多任务(multitasking

I/Oinput/output233                  标引器(indexer

 映射(map233                      命中列表(hit-list

全局语词统计值(global term statistics   线程(thread

算术逻辑单元(arithmetic logic unit, ALU 中介器(broker

虚拟处理器(virtual processor240

分布式信息检索(distributed information retrieval)249

文献收集器(gatherer                 主中介器(central broker254

 

10

信息可视化(information visualization        图标(icon260

颜色凸出显示(color highlighting            焦点+背景(focus-plus-context

画笔和链接(brushing and linking           魔术透镜(magic lenses

移动镜头和调焦(panning and zooming       弹性窗口(elastic window

概述及细节信息(overview plus details        高亮色显示(highlight

信息存取任务(information access tasks       文献替代(document surrogate

常见问题(FAQ, Frequently Asked Question)      群体性推荐(social recommendation

上下文关键词(keyword-in-context, KWIC      伪相关反馈(pseudo-relevance feedback

重叠式窗口(overlapping window            工作集(working set

 

11/12

多媒体信息检索(Multimedia Information Retrieval, MIR 超类(superclass

半结构化数据(semi-structured data                     数据片(data blade

可扩充型系统(extensible type system                    相交(intersect

动态服务器(dynamic server                            叠加(overlaps

档案库服务器(archive server                           聚集(center

逻辑结构(logical structure                             词包含(contain word

例子中的查询(query by example                        路径名(path-name

通过图像内容查询(Query by Image Content, QBIC       图像标题(image header

主要成分分析(Principal Component Analysis, PCA       精确匹配(exact match

潜语义标引(Latent Semantic Indexing, LSI              基于内容(content-based

范围查寻(Range Query

 

13

exponential growth指数增长                Distributed data 数据的分布性  

volatile data 不稳定数据                   redundant data 冗余数据   

Heterogeneous data异构数据               分界点(cut point373 

Centralized Architecture集中式结构         收集器-标引器(crawler-indexer373

Wanderers 漫步者     Walkers 步行者     Knowbots 知识机器人

Distributed Architecture分布式结构         gatherers 收集器    

brokers 中介器                           the query interface 查询界面  

the answer interface响应界面               PageRank 网页级别   

Crawling the Web漫游Web                 breadth-first 广度优先   

depth-first fashion 深度优先                Indicesindex pl.)索引   

Web Directories 网络目录                  Metasearchers元搜索引擎 

Teaching the User用户培训                 颗粒度(granularity384   

超文本推导主题检索(Hypertext Included Topic Search, HITS380                              

Specific queries专指性查询                 Broad queries 泛指性查询

Vague queries模糊查询                    Searching using Hyperlinks使用超链接搜索

Web Query Languages查询语言             Dynamic Search 动态搜索  

Software Agents 软件代理鱼式搜索(fish search

鲨鱼搜索(shark search)拉出/推送(pull/push393  

门户(portal395                         Duplicated data 重复数据    

 

14

联机公共检索目录(online public access catalog, OPAC397

化学文摘(Chemical Abstract, CA399      生物学文摘(Biological Abstract, BA

工程索引(Engineering Index,EI

国会图书馆分类法(Library of Congress Classification408

杜威十进分类法(Dewey Decimal Classification408

联机计算机图书馆中心(Online Computer Library Center, OCLC409

机读目录记录(Machine Readable Cataloging Record, MARC409

 

15

NSF (National Science Foundation, 美国国家科学基金会)

NSNANational Aeronautics and Space Administration 美国航空航天局)

数字图书馆创新项目(Digital Libraries Initiative, DLI415

5Sstream,信息流structure,结构space, 空间scenario, 场景society社会)416

基于数字化对象标识符(Digital Object Identifier, DOI420

都柏林核心(Dublin Core, DC430       数字图书馆(Digital Library, DL

资源描述框架(Resource Document Framework, RDF)431

text encoding initiative (TEI) (文本编码创新项目)431

 

v
posted @ 2007-12-31 16:38  岩仔  阅读(1399)  评论(0编辑  收藏  举报