Posts

Showing posts from May, 2015

SSIS Remove Duplicate Rows Using Fuzzy Grouping (SSIS Transformations)

Image
SSIS Remove Duplicate Rows Using Fuzzy Grouping (SSIS Transformations) Source data may have the duplicate rows which needs to be removed as part of data cleansing task. Fuzzy Grouping   is one of the transformation in Data Flow Transformations which can be used to group the similar rows in the source stream line. This transformation performs data cleaning tasks by identifying rows of data that are likely to be duplicate. Lets generate some of duplicate records as shown in below screen:- Create one DataFlow task inside which Drag OLE DB Source component and write the query shown above which will become as source data which has duplicate records.   Drag the Fuzzy Group component and open the editor. Go to the Columns tab, and check the checkbox agist the column which you want to analyze of similarity. after that go to the advanced tab. Here we can define Similarity threshold. Important values in aboe screenshot are _key_in and _key_out in our example as we ar

Fuzzy Grouping Transformation Sample

Image
 Fuzzy Grouping Transformation Fuzzy Grouping Transformation Use of fuzzy grouping I wanted to learn, how to use Fuzzy grouping transformation. But I was not getting any scenario, where this transformation could be used. Then while going through Google search, I got some scenario in which this transformation can be used. The scenarios are as follows: ·           Scenario1 : Say there are various clothes shops in a market. Some sales man sells cloths in shops. One wants to get the salesman, who has sold maximum clothes. For this the records of sale for each salesman in shop1 and shop2 is as follows: Shop1:                                                                  shop2:                          But there are some names, which misspelled .For getting the total sale of each salesman, we will need to use fuzzy grouping, which will group the names which are similar and will assign a common name to the similar names, so after that we can aggregate the sales of

Solution: Reporting Services permissions are insufficient for performing operation (rsAccessDenied)

Here's what to do:  Make sure you have access configured to the URL  http://localhost/reports  using the SQL Reporting Services Configuration. To do this: Open Reporting Services Configuration Manager -> then connect to the report server instance  -> then click on Report Manager URL. In the Report Manager URL page, click the Advanced button -> then in the Multiple Identities for Report Manager, click Add. In the Add a Report Manager HTTP URL popup box, select Host Header and type in: localhost Click OK to save your changes. Now start/ run Internet Explorer using  Run as Administator ...     NOTE: If you don't see the 'Site Settings' link in the top left corner while at  http://localhost/reports  it is probably because you aren't running IE as an Administator or you haven't assigned your computers 'domain\username' to the reporting services roles, see how to do this in the next few steps. Then go to: http://localhost/reports   (you