I have 700k-1M rows coming in with a comma delimited field (among others). I need to keep this column intact, but sort the rows so that I can determine duplicates. In this business case, a,3,null,40 is the same as null,40,a,3 and so on.
This can be achieved via stored proc, script component, tsql.
I found this component which does exactly what I need, I just don't have the knowledge for how to pull开发者_Python百科 the script out or how to compile it into a component!
http://sortdedupdelimstring.codeplex.com/
Any help is very much appreciated.
Please let me know if I was unclear anywhere.
The project is a Visual Studio 2005 Visual Basic project. It is for SQL Server 2005.
Load the project into VS and compile (F7), it will compile and install the component into SSIS for you. Create an SSIS project, click the Data Flow tab, and right-click anywhere on the Toolbox, select 'Choose Items...'. Go to the 'SSIS Data Flow Items' tab, locate the 'SortDeDuplicateDelimitedString' component and check the box. Now the component will appear in the 'Data Flow Transformations' list, and you can use it like you would any other component.
精彩评论