Operational Database is the database-of-record consisting of system-specific reference data and event data belonging to a transaction-update system. It may also contain system control data such as indicators flags and counters. The operational database is the source of data for the data warehouse. It contains detailed data used to run the day-to-day...
An Operational Data Store ODS is an integrated database of operational data. Its sources include legacy systems and it contains current or near term data. An ODS may contain 30 to 60 days of information while a data warehouse typically contains years of data. An operational data store is basically a database that is used for being an interim area...
On-Line Analytical Processing On-Line Analytical Processing is a processing that supports the analysis of business trends and projections. It is also known as decision support processing and OLAP. An OLAP software enables companies to have real-time analysis of data stored in a database. An OLAP server is typically a separate component of an...
On-Line Transaction Processing On-Line Transaction Processing is a processing that supports the daily business operations. Also know as operational processing and OLTP. An OLTP is a database which must typically allow the real-time processing of SQL transactions to support traditional retail processes e-commerce and other time-critical applications....
In the broadest sense of the word aggregation means collecting and combining of data horizontally vertically and chronologically and then expressed in summary form to be used for statistical analysis. In the more technical sense aggregation is a special kind association that specified a part of whole relationship between the component part and the...
Automatic data partitioning is the process of breaking down large chunks of data and metadata at a specific data site into partitions according to the request specification of the client. Data sites contain multitudes of varied data which can be extremely useful as a statistical basis for determining many trends in businesses. Because data in the...
A cache is a type of dynamic and high speed memory that is used to supplement the function of the central processing unit and the physical disk storage. The cache acts as a buffer when the CPU tries to access data from the disk so the data traveling from the CPU and physical disks can have synchronized speed. Disk reading and writing process is generally...
In relational database management system RDBMS terminology Access Path refers to the path chosen by the system to retrieve data after a structured query language SQL request is executed. A query may request at least one variable to be filled up with one value or more. A query may look like this SELECT family_name FROM users WHERE family_name...
An Ad-Hoc Query is a query that cannot be determined prior to the moment the query is issued. It is created in order to get information when need arises and it consists of dynamically constructed SQL which is usually constructed by desktop-resident query tools. This is in contrast to any query which is predefine and performed routinely. The word Ad...
Data administration refers to the way in which data integrity is maintained within data warehouse. Data warehouses are very large repository of all sorts of data. These data maybe of different formats. To make these data useful to the company the database running the data warehouse has to be configured so that it obeys the business...
In Data Aggregation value is derived from the aggregation of two or more contributing data characteristics. Aggregation can be made from different data occurrences within the same data subject business transactions and a de-normalized database and between the real world and detailed data resource design within the common data...
Data Collection Frequency just as the name suggests refers to the time frequency at which data is collected at regular intervals. This often refers to whatever time of the day or the year in any given length of period. In a data warehouse the relational database management systems continually gather extract transform and load data onto the storage...
In any data resource it is essential to meet requirements of current as well as future demand for information. Data completeness assures that the above criterion is fulfilled. Data completeness refers to an indication of whether or not all the data necessary to meet the current and future business information demand are available in the data resource....
Data Compression is a method using which the storage space required for storing data is reduced with the help of mathematical techniques. Data compression is also referred to as source coding. This is the process of encoding data information using as few bits as possible compared to the unencoded data. As a real life non digital analogy the...
Data Concurrency ensures that both official data source and replicated data values are consistent that means whenever data values official data source is updated then the corresponding replicated data values must also be updated via synchronization in order to maintain consistency. In a single user database each transaction is processed serially...
Data Conversion as the name implies deals with changes required to move or convert data from one physical environment format to that of another like moving data from one electronic medium or database product onto another format. Every day data is being shared from one computer to another. This is a very common activity especially in data warehouses...
Data warehouse is implemented in an organisation with the help of data architecture schema. Elements which are specific to the company or organisation are defined in data architecture schema. For instance the administrative structure should be designed according to the real life undertakings of the company s administrative department so...
The Data Flow Diagram is commonly used also for the visualization of structured design data processing. The normal flow is represented graphically. A designer typically draws context level DFD first showing interaction between the system and the outside entities. Then this context level DFD will then be exploded in order to further show the details...
From a general information technology technical perspective a data dictionary is a set of metadata which contains the definition and representation of data elements. From the perspective of a database management system a data dictionary is a set of table and views which can only be read and never altered. When implementing a data warehouse which...
Data Dimension is mainly used in data warehouse implementations. A data warehouse is implemented to that organizations can profit from data driven operation which constitute a major component in running businesses these days. To be effective with a data driven operation data which is the basis for statistical results for trending should be accurate...
The best example of dissemination is the ubiquitous internet. Every single second throughout the year data gets disseminated to millions of users around the world. Data could sit on the millions of severs located in scattered geographical locations. Data dissemination on the internet is possible through many different kinds of communications protocols....
The definition of what constitutes a duplicate has somewhat different interpretations. For instance some define a duplicate as having the exact syntactic terms and sequence whether having formatting differences or not. In effect there are either no difference or only formatting differences and the contents of the data are exactly the same. In any...
Spatial Data is a kind of data that reflects the real world which has become too complex for the direct and immediate understanding of data consumers. Spatial Data are used to create models of reality and designed to have some similarity with selected aspects of the real world including status and nature of the reality. A Spatial Database is therefore...
A Data Warehouse is not just a rich repository of company data. It is also an overall strategy and process for making a cutting edge decision support system. One of the main objectives of a Data Warehouse it to bring together various information from several sources whose platforms could be totally different from one another but the Data Warehouse...
A database can be vast shared collection composed of data which are logically related to each other. Businesses rely heavily on data as they are Databases are used for managing the business day to day tasks so Data Collection happens every single day. Collection of data may seem a simple and trivial task. But databases have gone a long way from simply...
In a company a database contains millions of atomic data. Atomic data are data information that cannot be further broken down. For example product name is an atomic data because it can longer be broken down but product raw material can be broken further into raw components depending on the good. An individual products sales is another atomic data. But...
Change Data Capture refers to the process of capturing changes which are made to a production data source. Change Data Capture is typically performed by reading the source of database management software logs. Some of the features of Change Data Capture are It consolidates units of works Ensures that data is synchronized with the original...
Classic Data Warehouse Development is the process of building an enterprise business model creating a system data model defining and designing a data warehouse architecture constructing the physical database and lastly populating the warehouses database. In a real business environment the data warehouse is the main repository of the company s...
It is very important to document consumer profiles in the data warehouse. Consumer profile constitutes an essential component when the organization needs reports on the operating trends and patterns and how the organization is performing. If the organization is into a business competition having customer profiles in the database can give the answer...
Critical Success Factors are areas of activity in which favorable results are necessary for a company to reach its goal. Critical Success Factors are intensive used in business organizations as essential guides for the company or project to achieve its mission and goals. For example one of the Critical Success Factors for of a company involved in...
Crosstab or Cross Tabulation is a process or function that combines and or summarizes data from one or more sources into a concise format for analysis or reporting. Crosstabs display the joint distribution of two or more variables and they are usually represented in the form of a contingency table in a matrix. A Crosstab should never be mistaken...
Data access is the process of entering a database to store or retrieve data. Data Access Tools are end user oriented tools that allow users to build structured query language SQL queries by pointing and clicking on the list of table and fields in the data warehouse. Thorough computing history there have been different methods and languages already...
In simple but technical term metadata is a data that describes another data. It can be any item describing an individual datum or a collection of multiple content items. Metadata is very useful in facilitating the use management and understanding of data in a large data warehouse. Depending on the type of data and the context where the data is being...
Data mapping is a very important aspect in data integration. In fact it is the first step in the many complex tasks associated with data integration which include data transformation or data mediation between a data source and its destination; identification of relationships in data which is vital in analysis of data lineage; discovery of sensitive...
In any data warehouse implementation there are many different considerations which should in place before the final physical setting up. This is to avoid in problems related to quality of data and consistencies in data processes. A conceptual schema is an abstract definition of the whole project. In the case of data warehouse and business intelligence...
Computer networks are the main connectivity mechanism for passing data in an electronic environment. A network is composed of several computers connected by a wired or wireless medium so data and other resources can pass through for sharing. A computer network may be as small as two computers connected by wire or wireless medium to as big as millions...
Data Derivation refers to the process of creating a data value from one or more contributing data values through a data derivation algorithm. Almost all business organizations in today s environment are becoming more and more dependent on the data produced from the data warehouses and information systems in order to support the company s operations....
Data Partitioning is the formal process of determining which data subjects data occurrence groups and data characteristics are needed at each data site. It is an orderly process for allocating data to data sites that is done within the same common data architecture. Data Partitioning is also the process of logically and or physically partitioning...
Data Repository is a logical and sometimes physical partitioning of data where multiple databases which apply to specific applications or sets of applications reside. For example several databases revenues expenses which support financial applications A R A P could reside in a single financial Data Repository. A database warehouse is one...
Data Scheme is a diagrammatic representation of the structure of data. It represents any set of data that is being captured manipulated stored retrieved transmitted or displayed. A Data Scheme can be a complex diagram with all sorts of geometric figures illustrating data structure and data relationships to one another in the relational database...
A data store is very a very important aspect of a data warehouse in that it acts as support of the companies need for up-to-the-second operational integrated collective information. It is a place where data such as databases and flat files are saved and stored. Data stores are great feeders of data to the data warehouse. In a broad sense a data...
Data Thesaurus deals with understanding patterns trends and relationships in historical data and providing visual information to the decision maker. Data Thesaurus helps to identify common business terms and data names. It is useful for locating data in metadata warehouse. A data thesaurus really consists of several metadata....
Data Warehouse Engines handle storage quering and load mechanisms of large database. It is an undisputable fact that implementing a data warehouse is such a very challenging task. This becomes even more challenging and difficult to do when we take into consideration the diversity of both operational data sources and target data warehouse engines....
Data Warehouse Infrastructure basically supports a data warehousing enviroment with the help of a combination of technologies. In its most general definition a data warehouse is large repository of all sorts of data the implementing organization would in need in the present and in the future. But the real of data warehouse and its...
Data values are what actually take place in the data variable set aside by the data entities and all its attributes. It consists of facts and figures of data items data attribues and data characteristcs. From the data model whose structural part includes collection of data structures used in creating objects and entities modeled...
Database is a collection of data which are logically related. Database as used in computer science is well defined and structured collection of data which are stored digitally in computer system. They are designed to be easily stored and retrieved using database queries a set of computer codes translated into a language that the database system can...
In Decentralized Warehouse a central gateway provides access to remote data with the help of a logical view. This central gateway processes real time user queries. Users can access and also query the remote data via central gateway. A data warehouse is very large repository of a company s historical and current transactional...
In Decentralized Database a big database is partitioned as per business requirement in such a way that each smaller database represents a specific data subject. Today it is a fact that most business organizations from small to medium sized to large multinational corporations can hardly go into operation without having to rely on information....
End User Data can either be data provided by a data warehouse or the data created by end users for query processing. The technical world of computing and computers has always been divided into two general reams. On the one realm are the high priests the knowledgeable group of people who how the ins and outs of computers its most complex details....
Metadata are data about data; each metadata describes an individual data content item or a collection of data which includes multiple content items. Metadata Synchronization consolidates related data from different systems and synchronizes them for easier access. Metadata are very important components of any data warehouse implementation because...
Information consumers are everywhere are it has become of life that data and information have become driving forces in almost all aspects of our daily operations. With the ubiquity of the internet connection today s information consumers includes people of all ages and all walks of life and even non humans like artificial intelligence technologies...
A data warehouse is a repository of a business organization s historical data. It is a large part of an enterprise data management system which consists of several servers running on different kinds of platforms and database management systems. It is generally practiced that in an enterprise data management system it is the data warehouse house...
An enterprise data management system that consists of data stores and data warehouse may have several data sources. Primary Data Source is the first data site at which the original data is stored after their origination. Imagine the data warehouse whose database is the repository of all of the company s historical data. The data warehouse is the...
Also known as a primary keyword or a unique identifier a primary key is key used in a relational database which uniquely represents each record. It is a set of one or more data characteristics and its value uniquely identifies each data occurrence in a data subject. It can be any unique identifier in a database table s records such a driver s...