Techno Maze

The Squirrels and the Chestnuts

These days I am trying out some problems that Codechef has put up for the competitions. I will be posting the algorithms I used for anyone who’s interested as well as for comments and suggestions for improvement.

Problem (Taken from the June 2010 competition at Codechef)

There are n squirrel(s) waiting below the feet of m chestnut tree(s). The first chestnut of the i-th tree will fall right after Ti second(s), and one more every Pi second(s) after that. The “big mama” of squirrels wants them to bring their nest no less than k chestnuts to avoid the big storm coming, as fast as possible! So they are discussing to wait below which trees to take enough chestnuts in the shortest time. Time to move to the positions is zero, and the squirrels move nowhere after that.

Request

Calculate the shortest time (how many seconds more) the squirrels can take enough chestnuts.

Algorithm

FindLeastTime (integer numberOfSquirrels,integer numberOfChestnuts,integer numberOfTrees,array startTimes,array intervals)

{

/*Comment

startTimes-contains the time at which the first chestnut falls from each tree

intervals-contains the time interval between consecutive chestnuts from each tree

time=0;

array multiplier;// represents the time interval number for each tree

array tally;//contains the chestnuts from each tree at a given time

while (ChestnutCount

{

for(each tree i)

{

if(startTimes[i]+multiplier[i]*intervals[i]<=time)

{

tally[i]=tally[i]+1;

multiplier[i]=multiplier+1;

}

tempArray=Sort(tally);//ascending order

for(j=1;j<=numberOfSqurriels;j=j+1)

{

count=count+tally[j th index from the last];

}

if(ChestnutCount

{

time=time+1;

}

return time;

}

Try implementing the algorithm in a language of your preference. I tried out C# and Java. C# was somewhat faster in execution.

Algorithms

June 3, 2010

A small problem encountered in Creating MySQL stored procedures

The other day I started working on a small personal project which involved a MySQL database. I’m pretty familiar with MS SQL and MySQL is almost the same in so far as the language goes so I got right down creating the database and the required stored procedures. This is where I hit a little snag. May be not so little considering the time it took me to figure it out 🙂

The code i wrote for the stored procedure is given below.

CREATE PROCEDURE test()

BEGIN

SELECT * FROM entries;

END

The query is as simple as can be and worked perfectly when I ran it the phpmyadmin SQL query window. But whenever I try to create the stored procedure I kept getting the error code 1064 which basically indicates a syntax error. After a lot of reading on creating stored procedures in MySQL I was able to confirm my syntax was in fact correct. So what was the problem?

This is what I finally found out from a discussion over at StackOverFlow. MySQL considers ; to be the statement delimiter by default. The entire Create Procedure is also considered a statement to which delimiters need to be applied. So when we come to the semi colon in my code the compiler considers this as the delimiter for the entire CREATE PROCEDURE block. But END corresponding to the BEGIN is outside the semi colon which throws the error I was getting.

All I had to do was change the delimiter to // using the delimiter option text box phpadmin provides right under the query editing area. This way a // is added automatically added to the end of the entire block and the semi colon works as a delimiter for simple statements.

Those of you who are using an MySQL client that is command line based will need to modify your code as shown

delimiter //

—-put your your code here-

delimiter ;

This will change the delimiter back to the default once your code has been executed.

PHP & MySQL

MySQL, phpmyadmin, Stored Proecedures

May 30, 2010

Using an XML Configuration file and Expressions in an SSIS package

Introduction

In my previous posts I’ve given an introduction in to developing simple ETL packages using Microsoft Integration Services. In this post I’ll be showing how we can make your ETL’s more dynamic using expressions evaluated at run time as well as how to set properties of an ETL during run time using an XML configuration file.

The use of expressions allows a developer to create an ETL where settings such as Database connection strings,output filenames, file locations are acquired at run time, while the use of an XML configuration file allows easily changing common settings across multiple ETL packages.

I’ll be demonstrating these functionalities using the simple ETL package developed in my previous posts to which I will add a XML configuration file to set the input file location (this usually not the kind of value that is set by a configuration file since the source file location changes from ETL to ETL, a better value would be a connection string of a shared data source for multiple ETL’s ). Then I will provide the file location as an expression to the file connections used in the ETL so that the actual file location will be acquired by the connections at run time.

Step 1:

Create a String variable with global scope to hold the path of the input data file.

The variable created to hold the file path

Step 2:

Next right click on the control flow design area and select the package configuration option and the from the screen that appears tick the check box for enabling package configurations and click Add button to add a configuration.

The following screen will then pop up.

Adding an XML Configuration file

From the configuration type select XML configuration file (Note: there are a number of possible configuration types including parent package variables which will be introduced in a future post).

Then select the Option “Specify Configuration Settings Directly” option and for the configuration file name give a suitable name at a location of your choice by using “Browse”

Setting the configuration file name

The next step in the wizard allows you to set the properties of the package whose values would be set by the configuration file. In this case I’ve given Only the value 0f the InputFilePath variable as the property to be set by the XML configuration file as seen by the screen shot below.

adding properties to be set using the XML file

Now you can continue and in the next step of the wizard set the name for the configuration as seen by the project. The next time you access project configurations this configuration will appear in the list with the given name.

Step 3:

Lets take a look at the configuration file we’ve created. Browse to the file location on the file system and open it using a text editor.

The XML configuration file

Each property to be set by the configuration file has a its information within a <Configuration></Configuration> tag pair. Notice the configuration tags for the InputFilePath variable. Note that the name of the property has to be identified by the Path attribute within the configuration tags.

The value to the property should be set within the <ConfiguredValue></ConfiguredValue> tags within the relevant Configuration tags as seen by the screen shot below.

Setting the value within the XML Configuration file

The XML configuration file has now been modified to provide the location of the input file to the package at run time. Lets move back to the SSIS project.

Step 4:

Now that we have set the XML configuration file to set the value to the InputFilePath at run time we can use this variable to dynamically provide the input file path to the Excel File Connection Manager and The RowSet Schema Connection Manager.

Select the Excel File Connection Manager that we’ve created from the connection managers area. Then from the properties pane that appears on the right select the option “Expressions”.

The Expressions option in the connection manager

Note that this expression option is available in most of the components of SSIS to allow dynamically setting of their properties.

The screen that appears when you select “Expressions” will allow you to select different properties of the Excel Connection Manager and construct the corresponding expression. In this case I’ve selected the Excel File Path property and set the expression to the InputFilePath variable as shown

Selecting the Excel File Path property

Constructing the expression using Expression Builder

You can simply drag and drop variables from the the variable list on the list as well as type constants in the expression area. The expression builder also allows us to use an umber of String,Math,Logic etc operations. These too can be dragged and dropped.

Once the expression is set the Expression screen for the Excel File Connection Manager should look like this.

Excel File Path with the Expression set

Now select the Excel RowSet Schema connection Manager that we use to iterate through the sheets of the input Excel file and set its “Server Name” property to the same expression as the one to which the Excel File Path was set.

Conclusion

Now the SSIS package is configured so that during run time the XML configuration file sets the value of the variable InputFilePath and this path is assigned to the Excel File Source to extract data using Expressions. Normally the use of expressions and a configuration file is unnecessary for a simple package such as the one used with just one variable used in expressions for two connections. However when a value is used in multiple components in the same package or even multiple packages using the expressions and a configuration file can not only standardize the packages but also significantly reduce the work needed to modify a value across multiple components or packages (only needs to be modified once at the configuration file) .

Run the completed package and observe how added configuration works.

Enjoy1!:)

Integration Services, MS SQL Server, Uncategorized

Configuration, Expressions, SSIS, XML

May 20, 2010

Extracting data from multiple sheets in an Excel file in SSIS

Introduction

In my previous post i showed how we could develop a simple ETL package using Microsoft SQL Server Integration Services to transfer data in a single sheet of an Excel file t0 a table in a database.

But what if you had multiple sheets of data in the your Excel file? This post is meant to show you how to tackle this problem

What I will touch on in this post

Use of package variables in SSIS
Use of the foreach control flow component
Use of a script task
Possible errors

Step 1:

The ETL package we completed in the previous post can be extended to demonstrate how multiple sheets from an Excel file.

As a first step I’ve extended the data in the excel file as shown. Note that the data in the first sheet has already been inserted in to the database by the ETL from the previous post. I will use this fact to demonstrate the function of the Slowly Changing Dimension component that has been used in the ETL.

Sheet1

Sheet 2

In order to tackle multiple sheets in an Excel file we need to use the SSIS package variable feature to maintain states as well as for testing certain conditions.In this example I’ve created to package variables with a global scope and off the seen data types. The need for these variables will become clear as we go on.

package variables