The warehouse ensures local consistency, global consistency, and referential integrity despite "dirty" sources and massive database size.įact-based management must not be slowed by the performance of the data warehouse RDBMS large, complex queries must be complete in seconds, not days.ĭata warehouse sizes are growing at astonishing rates. Many phases must be taken to load new or update data into the data warehouse, including data conversion, filtering, reformatting, indexing, and metadata update.įact-based management demands the highest data quality. Summarization algorithms, predefined queries, and reports business data, which include business terms and definitions, ownership information, etc.ĭata warehouses require increase loading of new data periodically basis within narrow time windows performance on the load process should be measured in hundreds of millions of rows and gigabytes per hour and must not artificially constrain the volume of data business.Information about the mapping from operational databases, which provides source RDBMSs and their contents, cleaning and transformation rules, etc.System performance data, which includes indices, used to improve data access and retrieval performance.Operational metadata, which usually describes the currency level of the stored data, i.e., active, archived or purged, and warehouse monitoring information, i.e., usage statistics, error reports, audit, etc. A description of the DW structure, including the warehouse schema, dimension, hierarchies, data mart locations, and contents, etc.It includes the following parameters and information for the middle and the top-tier applications: The metadata repository stores information that defines DW objects. The overall Data Warehouse Architecture is shown in fig: (2) A Multidimensional OLAP (MOLAP) model, i.e., a particular purpose server that directly implements multidimensional information and operations.Ī top-tier that contains front-end tools for displaying results provided by OLAP, as well as additional tools for data mining of the OLAP-generated data. (1) A Relational OLAP (ROLAP) model, i.e., an extended relational DBMS that maps functions on multidimensional data to standard relational operations. The OLAP server is implemented using either A gateway is provided by the underlying DBMS and allows customer programs to generate SQL code to be executed at a server.Įxamples of gateways contain ODBC (Open Database Connection) and OLE-DB (Open-Linking and Embedding for Databases), by Microsoft, and JDBC (Java Database Connection).Ī middle-tier which consists of an OLAP server for fast querying of the data warehouse. It may include several specialized data marts and a metadata repository.ĭata from operational databases and external sources (such as user profile data provided by external consultants) are extracted using application program interfaces called a gateway. Next → ← prev Three-Tier Data Warehouse Architectureĭata Warehouses usually have a three-level (tier) architecture that includes:Ī bottom-tier that consists of the Data Warehouse server, which is almost always an RDBMS.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |