This is a requirement for one of our clients. They have two sources of data. Once is a MySql data, which is in an unstructured format, and another Excel data, in a structured format. I need to extract both, join them together and then load cubes.
So, what I did is as follows:
1. Extracted the MySql data and the Excel data
2. Denormalized the MySql extract so that the unstructured data is transformed into structured data
3. Joined the two tables - Denormalized MySQL extract and the Excel extract
4. Used the joined table to load my cube
However, the entire process is taking a lot of time, and it sometime takes a few hours to complete the cube load. The MySql data has about 400,000 rows and Excel about 60,000 rows.
Please suggest a better way to do the process. I am meeting the client tomorrow, and any quick help is really appreciated.
Thank you,
Kiran
So, what I did is as follows:
1. Extracted the MySql data and the Excel data
2. Denormalized the MySql extract so that the unstructured data is transformed into structured data
3. Joined the two tables - Denormalized MySQL extract and the Excel extract
4. Used the joined table to load my cube
However, the entire process is taking a lot of time, and it sometime takes a few hours to complete the cube load. The MySql data has about 400,000 rows and Excel about 60,000 rows.
Please suggest a better way to do the process. I am meeting the client tomorrow, and any quick help is really appreciated.
Thank you,
Kiran