Grouping is to allocate the samples into several groups according to a specific flag. There is a difference between groups and the relative commonness shared by group members. The grouping plays an important role in statistical analysis. For example, the type grouping is used to differentiate the types of economy, soci...
2014-05-19 775 0
0
Hadoop is an outstanding distributed computational system whose default developing mode is MapReduce coding. However, MapReduce is not specially designed for data computing. Plus, its syntax mechanism is cumbersome, the coding efficiency for computation is relatively low, and it is even more difficult to compose the u...
2014-05-15 805 0
0
Description of the Issue
Some reports in a project suffered from very low speed. Despite various optimizations to iReport and Oracle database, the situation is not yet satisfying. For example, there is a detail report, involving large data volume, many (dozens of) data tables, and frequent inter-table join(including s...
2014-05-14 745 0
0
The low efficiency of Hadoop computation is an undeniable truth. We believe, one of the major reasons is that the underlying computational structure of MapReduce for Hadoop is basically of the external memory computation. The external memory computation implements the data exchange through the frequent external memory...
2014-05-13 784 0
0
Recently a development team met some difficulties in data source computation when developing iReport reports. After the use of esProc for cross database computation, the problem is resolved.
This is a project payment progress report, as part of a project management system. It’s based on an Oracle database. The repor...
2014-05-12 671 0
0
The More Convenient Data Computing Scripts I
III Ordered Set
Case Description
Suppose that a telecommunication equipment manufacturer needs to compute the monthly link relative ratio of sales value (i.e. the increase percent of sales value of each month compared with that of the previous month). The sales data is stor...
2014-05-09 759 0
0
Database plays an irreplaceable role in the modern economy and is widely used in the business computing areas like Enterprise Resources Planning (ERP), Customer Relation Management (CRM), Supply Chain Management (SCM), and the Decision Support System (DSS).
Computation on the structured data in the database mainly reli...
2014-05-08 775 0
0
The Big Data Real-time Application is a scenario to return the computation and analysis results in real time even if there are huge amount of data. This is an emerging demand on database applications in recent years.
In the past, because there are not so many data, the computation is simple, and few parallelisms, the p...
2014-05-07 784 0
0
As we know, SQL users have to finish composing all codes and then run them all at one time, resulting in inconvenient interactive computing. However, the simple and easy-to-understand query syntax of SQL is always welcomed by programmers. R language and esProc as the computing and analysis tool are surely required to o...
2014-05-06 715 0
0
In the real business, quite a few report data is from file instead of database. Take the below case for example: To compute over the attendance data files for preparing the salary payable report, the log file for preparing the user behavior analysis report, and the stock trading record file for preparing the stock anal...
2014-05-05 848 0
0
All data can become valuable only by getting involved in computing and analytics. The big data makes no exception. The computational capability on structured big data determines the range of practical applications of big data. In this article, I’d like to introduce several computation methods: API, Script, SQL, a...
2014-05-04 786 0
0
Description of the Issue
Some reports in a project suffered from very low speed. Despite various optimizations to iReport and Oracle database, the situation is not yet satisfying. For example, there is a detail report, involving large data volume, many (dozens of) data tables, and frequent inter-table join(including se...
2014-04-24 760 0
0
What is IOE? I=IBM, O=Oracle, and E=EMC. They represent the typical high-end database and data warehouse architecture. The high-end servers include HP, IBM, and Fujitsu, the high-end database software includes Teradata, Oracle, Greenplum; the high-end storages include EMC, Violin, and Fusion-io.
In the past, such typic...
2014-04-23 928 0
0
A stored procedure is a collection of SQL queries and flow control statements. The stored procedure is usually used to break a complex computation task down to several simple computation steps. Although the stored procedure can improve the flexibility at the cost of complexity, it is always hindered by the inherent dra...
2014-04-22 750 0
0
The computing in business activities involves enterprise reporting (Reporting), business data integration and cleaning (Data Integration and ETL), OLAP (Online Analysis Process), ERP (Enterprise Resource Planning), CRM (Customer Relationship Management), SCM (Supply Chain Management), and DSS (Decision Support System)....
2014-04-21 817 0
0