Software and metadata structures for distributed and interactive database architecture for parallel and asynchronous data processing of complex data and for real-time query processing
Uloženo v:
| Název: | Software and metadata structures for distributed and interactive database architecture for parallel and asynchronous data processing of complex data and for real-time query processing |
|---|---|
| Patent Number: | 9,454,578 |
| Datum vydání: | September 27, 2016 |
| Appl. No: | 14/953365 |
| Application Filed: | November 29, 2015 |
| Abstrakt: | The various embodiments of the invention provide software and metadata structures for a data processing system and method, for applications such as marketing campaign management, speech recognition and signal processing. An exemplary system embodiment includes a first data repository adapted to store a plurality of entity and attribute data; a second data repository adapted to store a plurality of entity linkage data; a metadata data repository adapted to store a plurality of metadata modules, with a first metadata module having a plurality of selectable parameters, received through a control interface, and having a plurality of metadata linkages to a first subset of metadata modules; and a multidimensional data structure. The control interface may modify the plurality of selectable parameters in response to received control information. A plurality of processing nodes are adapted to use the plurality of selectable parameters to assemble a first plurality of data from the first and second data repositories and from input data, to reduce the first plurality of data to form a second plurality of data, and to aggregate and dimension the second plurality of data for storage in the multidimensional data structure. |
| Inventors: | Experian Marketing Solutions, Inc. (Schaumburg, IL, US) |
| Assignees: | Experian Marketing Solutions, Inc. (Schaumburg, IL, US) |
| Claim: | 1. A parallel and asynchronous computing system comprising: a data network; a plurality of data storage devices coupled to the data network, the plurality of data storage devices storing a plurality of data repositories having a plurality of data structures, storing a first program construct of a plurality of program constructs, the first program construct comprising a plurality of modifiable parameters adapted to be modifiable during data processing, the first program construct further defining a plurality of selectable input data sources and a plurality of selectable attributes; a second program construct of the plurality of program constructs, the second program construct having a corresponding metadata linkage to the first program construct, the second program construct defining a plurality of data structures for a plurality of data repositories of the computer system, the plurality of data structures comprising at least one relational data structure and at least one multidimensional data structure; a third program construct of the plurality of program constructs, the second program construct having corresponding metadata linkages to the first and second program constructs, the third program construct adapted to provide for execution of a plurality of data processes using the plurality of modifiable parameters, the plurality of input data sources and the plurality of selectable attributes, the third program construct further adapted to store a plurality of results of the plurality of data processes in the plurality of data structures; and a fourth program construct of the plurality of program constructs, the fourth program construct having corresponding metadata linkages to the first, second and third program constructs, the fourth program construct adapted to use the plurality of selectable attributes for a dimensioning and aggregating data for storage in the multidimensional data structure; and a plurality of computing devices coupled to the plurality of data storage devices through the data network, each computing device of the plurality of computing devices comprising one or more processors, a memory, and network interface coupled to the data network, one or more computing devices of the plurality of computing devices adapted to execute the plurality of data processes using the plurality of modifiable parameters, the plurality of input data sources and the plurality of selectable attributes, and to use the plurality of selectable attributes to a dimension and aggregate data for storage in the multidimensional data structure. |
| Claim: | 2. The computing system of claim 1 , wherein: a first subset of the plurality of data processes comprises at least two data assembly processes of the following group of data assembly processes: a data input process, a data layout process, a data conversion process, a data correction process, a data audit process, a data matching process, a linkage assignment process, a linkage selection process, a data coding process, a data enhancement process, a data change detection process, or a data update process; and a second subset of the plurality of data processes comprises at least one data reduction process of the following group of data reduction processes: a data netdown process, a data suppression process, or a data prioritization process. |
| Claim: | 3. The computing system of claim 2 , wherein the data assembly processes and data reduction processes correspondingly provide for data assembly and data reduction on the basis of the plurality of selectable attributes. |
| Claim: | 4. The computing system of claim 1 , wherein the one or more computing devices of the plurality of computing devices adapted to execute the plurality of data processes in parallel and asynchronously as a plurality of data processing nodes of the data processing system. |
| Claim: | 5. The computing system of claim 4 , wherein the third program construct is further adapted to provide for storage in a memory or in a database of a corresponding plurality of interim results from the asynchronous and parallel execution of the plurality of data processes. |
| Claim: | 6. The computing system of claim 5 , wherein the third program construct is further adapted to provide for the storage of the corresponding plurality of interim results as a corresponding set of unique identifiers. |
| Claim: | 7. The computing system of claim 1 , wherein the plurality of data storage devices further store a fifth program construct of the plurality of program constructs, the fifth program construct having a corresponding metadata linkage to the third program construct, the fifth program construct adapted to track each data process invocation of a plurality of data process invocations. |
| Claim: | 8. The computing system of claim 7 , wherein the plurality of data storage devices further store a sixth program construct of the plurality of program constructs, the sixth program construct having a corresponding metadata linkage to the fifth program construct, the sixth program construct adapted to associate a cost factor and a usage factor with each data process invocation of a plurality of data process invocations. |
| Claim: | 9. The computing system of claim 1 , wherein the plurality of data storage devices further store a seventh program construct of the plurality of program constructs, the seventh program construct adapted to determine a plurality of sets resulting from the dimensioned and aggregated data in the multidimensional data structure and to perform a plurality of set operations on the plurality of sets, the plurality of set operations comprising union operations, difference operations, and intersection operations. |
| Claim: | 10. The computing system of claim 1 , wherein the plurality of selectable input data sources comprises a plurality of databases and marketing lists containing a plurality of attributes, the plurality of attributes comprising at least two attribute types of the following group of attribute types: a name attribute, an address attribute, a demographic attribute, a behavioral attribute, or an attitudinal attribute. |
| Claim: | 11. The computing system of claim 1 , wherein the plurality of modifiable parameters further define a plurality of selectable data enhancements. |
| Claim: | 12. The computing system of claim 1 , wherein the fourth program construct is further adapted to use the multidimensional data structure to create a response to an input query. |
| Claim: | 13. The computing system of claim 1 , wherein the plurality of data storage devices further store an eighth program construct adapted to determine a plurality of available variations of at least one of the following: selectable data processes, selectable data attributes, or selectable data segmentations. |
| Claim: | 14. The computing system of claim 1 , wherein the plurality of data storage devices further store a ninth program construct adapted to provide data modeling and data analysis using the plurality of modifiable parameters, the plurality of selectable input data sources, and the plurality of selectable attributes. |
| Claim: | 15. A computing system, comprising: a data network; a plurality of data storage devices coupled to the data network, the plurality of data storage devices storing a first program module defining a plurality of modifiable parameters adapted to be modifiable during data processing and a plurality of selectable attributes; a second program module defining a plurality of selectable input data sources, the plurality of selectable input data sources comprising at least one consumer database and a plurality of consumer lists of individuals and households; and a third program module defining a plurality of data structures for a plurality of data repositories of the computer system, the plurality of data structures comprising a first relational data structure, a second relational data structure storing a subset of information from the first relational data structure, and at least one multidimensional data structure; and a plurality of computing devices coupled to the plurality of data storage devices through the data network, each computing device of the plurality of computing devices comprising one or more processors, a memory, and network interface coupled to the data network, one or more computing devices of the plurality of computing devices adapted to execute a plurality of marketing campaign management data processes using the plurality of modifiable parameters, the plurality of selectable input data sources and the plurality of selectable attributes; to perform a first plurality of data assembly processes and store assembled data in the first relational data structure, to perform a second plurality of data reduction processes on the assembled data and store reduced data in the second relational data structure, to use the plurality of selectable attributes for dimensioning and aggregation of the reduced data for storage in the multidimensional data structure; to correlate invocation of the plurality of marketing campaign management data processes with accounting functions and resource utilization; and to track selection of the plurality of consumer lists. |
| Claim: | 16. A computing system, comprising: a data network; a plurality of data storage devices coupled to the data network, the plurality of data storage devices storing a first metadata module of a plurality of metadata modules, the first metadata module comprising a plurality of modifiable parameters adapted to be modifiable during data processing; a second metadata module of the plurality of metadata modules, the second metadata module defining a plurality of selectable input data sources comprising at least one consumer database and a plurality of consumer lists; a third metadata module of the plurality of metadata modules, the third metadata module defining a plurality of data structures for a plurality of data repositories of the data processing system, the plurality of data structures comprising a first relational data structure storing a first subset of information, a second relational data structure storing a second subset of information from the first relational data structure, and at least one multidimensional data structure storing a third subset of information from the second relational data structure which has been dimensioned and aggregated; a plurality of computing devices coupled to the plurality of data storage devices through the data network, each computing device of the plurality of computing devices comprising one or more processors, a memory, and network interface coupled to the data network, one or more computing devices of the plurality of computing devices adapted to execute a plurality of data assembly processes using the plurality of selectable input data sources and the plurality of modifiable parameters to form the first subset of information; to execute a plurality of data reduction processes using plurality of modifiable parameters to form the second subset of information; to dimension and aggregate the second subset of information to form the third subset of information, using a plurality of selectable attributes; to perform a plurality of set operation on the third subset of information using the plurality of selectable attributes; to utilize the multidimensional data structure to process a query and to generate a response to the query; and to provide at least one selection recommendation for the plurality of modifiable parameters, the plurality of selectable input data sources, or the plurality of selectable attributes. |
| Claim: | 17. The computing system of claim 16 , wherein the plurality of data storage devices further store a metadata module, of the plurality of metadata modules, adapted to track each invocation of a data process of the plurality of data assembly processes and the plurality of data reduction processes, and to associate each data process invocation with an accounting function and with a resource usage function. |
| Claim: | 18. The computing system of claim 16 , wherein: the plurality of data assembly processes comprises at least two data assembly processes of the following group of data assembly processes: a data input process, a data layout process, a data conversion process, a data correction process, a data audit process, a data matching process, a linkage assignment process, a linkage selection process, a data coding process, a data enhancement process, a data change detection process, or a data update process; and the plurality of data reduction processes comprises at least one data reduction process of the following group of data reduction processes: a data netdown process, a data suppression process, or a data prioritization process. |
| Claim: | 19. The computing system of claim 16 , wherein the plurality of selectable attributes comprises at least two attribute types of the following group of attribute types: a name attribute, an address attribute, a demographic attribute, a behavioral attribute, or an attitudinal attribute. |
| Patent References Cited: | 2006/0271568 November 2006 Balkir |
| Primary Examiner: | Woo, Isaac M |
| Attorney, Agent or Firm: | Gamburd, Nancy R. Gamburd Law Group LLC |
| Přístupové číslo: | edspgr.09454578 |
| Databáze: | USPTO Patent Grants |
| Abstrakt: | The various embodiments of the invention provide software and metadata structures for a data processing system and method, for applications such as marketing campaign management, speech recognition and signal processing. An exemplary system embodiment includes a first data repository adapted to store a plurality of entity and attribute data; a second data repository adapted to store a plurality of entity linkage data; a metadata data repository adapted to store a plurality of metadata modules, with a first metadata module having a plurality of selectable parameters, received through a control interface, and having a plurality of metadata linkages to a first subset of metadata modules; and a multidimensional data structure. The control interface may modify the plurality of selectable parameters in response to received control information. A plurality of processing nodes are adapted to use the plurality of selectable parameters to assemble a first plurality of data from the first and second data repositories and from input data, to reduce the first plurality of data to form a second plurality of data, and to aggregate and dimension the second plurality of data for storage in the multidimensional data structure. |
|---|