ITPub博客

首页 > Linux操作系统 > Linux操作系统 > 关于数据仓库-Inmon-企业信息工厂(CIF)概览

关于数据仓库-Inmon-企业信息工厂(CIF)概览

原创 Linux操作系统 作者:bq_wang 时间:2008-02-13 17:26:05 0 删除 编辑

翻译总是一件很痛苦的事情,看着别人翻译的很烂,心里总是会暗暗骂上几句,当自己翻译的时候,才了解翻译的痛苦。。。

关于DW2.0和CIF,我没什么直观的感觉,数据仓库终究还是数据仓库。。。


Corporate Information Factory (CIF) Overview

企业信息工厂(CIF)概览

The Corporate Information Factory and the Web Environment

CIF

摘要描述

Operational Systems are the internal and external core systems that support the day-to-day business operations. They are accessed through application program interfaces (APIs) and are the source of data for the data warehouse and operational data store. (Encompasses all operational systems including ERP, relational and legacy.)

业务系统是支持日常业务操作的内外部的核心系统。通常它们可以通过应用程序接口(APIs)进行访问,同时也是数据仓库和ODS系统的数据源。(包括所有的业务系统例如ERP,相关和遗留的系统等等)

Data Acquisition is the set of processes that capture, integrate, trans-form, cleanse, reengineer and load source data into the data warehouse and operational data store. Data reengineering is the process of investigating, standardizing and providing clean consolidated data.

数据获取是获取,集成,转换,清洗,重构和数据加载到数据仓库和ODS系统的一系列的过程。数据重构是调研、标准化以及清洗统一数据的过程。

The Data Warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data used to support the strategic decision-making process for the enterprise. It is the central point of data integration for business intelligence and is the source of data for the data marts, delivering a common view of enterprise data.

数据仓库是基于主题的、集成的、时变的、非易失的数据的集合,为企业的战略决策制定过程提供支持。这是商业智能数据集成的核心,也是数据集市的数据源,同时提供了一个企业数据的公共视图。

Primary Storage Management consists of the processes that manage data within and across the data warehouse and operational data store. It includes processes for backup and recovery, partitioning, summarization, aggregation, and archival and retrieval of data to and from alternative storage.

基本存储管理由数据仓库和ODS中管理数据的一系列过程构成。它包含备份和恢复、分区、摘要、聚合、从替代存储中归档和恢复的一系列过程。

Alternative Storage is the set of devices used to cost-effectively store data warehouse and exploration warehouse data that is needed but not frequently accessed. These devices are less expensive than disks and still provide adequate performance when the data is needed.

替代存储是这样一套设备,通常被用来低成本且有效的存储数据仓库数据,同时能够探测和访问那些必要但是低访问率的数据仓库数据。这些设备一般比磁盘便宜,同时能够提供足够的性能,当访问数据的时候。

Data Delivery is the set of processes that enable end users and their supporting IS group to build and manage views of the data warehouse within their data marts. It involves a three-step process consisting of filtering, formatting and delivering data from the data warehouse to the data marts.

数据交付是一套能够保证终端用户和决策支持群组构建和管理数据仓库视图的过程。它包括3个步骤:从数据仓库中过滤、格式化、交付数据到数据集市中。

The Data Mart is customized and/or summarized data derived from the data warehouse and tailored to support the specific analytical requirements of a business unit or function. It utilizes a common enterprise view of strategic data and provides business units more flexibility, control and responsibility. The data mart may or may not be on the same server or location as the data warehouse.

数据集市是来源于数据仓库的定制化数据或者摘要数据,裁减后用来满足对业务功能的特殊分析需求。它利用企业的公共视图,向企业单元提供更大的弹性、控制和响应。数据集市和数据仓库不一定在同一台服务器和同一位置。

The Operational Data Store (ODS) is a subject-oriented, integrated, current, volatile collection of data used to support the tactical decision-making process for the enterprise. It is the central point of data integration for business management, delivering a common view of enterprise data.

ODS是基于主题的、集成的、当前的、易失的数据的集合,用来向企业提供决策支持。这是业务管理中数据集成的核心,同时交付企业数据的公共视图。

Meta. Data Management is the process for managing information needed to promote data legibility, use and administration. Contents are described in terms of data about data, activity and knowledge.

元数据管理是提供数据可理解、使用和管理的管理信息的过程。主要用来记录数据、行为和知识。

The Exploration Warehouse is a DSS architectural structure whose purpose is to provide a safe haven for exploratory and ad hoc processing. An exploration warehouse utilizes data compression to provide fast response times with the ability to access the entire database.

探测数据仓库是决策支持架构,它的目的是为探测和增强查询提供安全接口。一个探测数据仓库利用数据压缩技术提供快速响应能力。

The Data Mining Warehouse is an environment created so analysts may test their hypotheses, assertions and assumptions developed in the exploration warehouse. Specialized data mining tools containing intelligent agents are used to perform. these tasks.

数据挖掘仓库是在探测数据仓库中开发的,分析员能够测试他们假设、推断、设想的创建的环境。专业的数据挖掘功能包括用来实施该任务的智能代理。

Activities are the events captured by the enterprise legacy and/or ERP systems as well as external transactions such as Internet interactions.

行为是通过企业遗产系统或ERP系统获取的事件,同时也包括向互联网交互的外部交易。

Statistical Applications are set up to perform. complex, difficult statistical analyses such as exception, means, average and pattern analyses. The data warehouse is the source of data for these analyses. These applications analyze massive amounts of detailed data and require a reasonably performing environment.

统计应用是被用来实施复杂的、难度较大的统计分析,例如异常、方法、平均值和方式分析。数据仓库是这些分析的数据源。这类应用能够分析大量明细数据,同时也需要一个适度的实施环境。

Analytic Applications are pre-designed, ready-to-install, decision sup-port applications. They generally require some customization to fit the specific requirements of the enterprise. The source of data is the data warehouse. Examples of these applications are risk analysis, database marketing (CRM) analyses, vertical industry "data marts in a box," etc.

分析应用是预设计、预安装、决策支持应用。这通常需要一些定制工作,来满足企业的特殊需求。它的源数据来自于数据仓库。这些应用的例子通常是风险分析、CRM分析等等。

External Data is any data outside the normal data collected through an enterprise's internal applications. There can be any number of sources of external data such as demographic, credit, competitor and financial information. Generally, external data is purchased by the enterprise from a vendor of such information.

外部数据是由企业内部应用系统采集的常规数据之外的所有数据。可能有许多的外部数据源例如人口统计、信用卡、竞争对手信息和财务信息。但是通常情况下,外部数据源是企业从外部信息提供商购买的。

来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/6517/viewspace-145519/,如需转载,请注明出处,否则将追究法律责任。

请登录后发表评论 登录
全部评论

注册时间:2007-12-07

  • 博文量
    412
  • 访问量
    1103086