Computing.Net > Forums > Unix > sort on 60M lines

Computer Problems? Computing.Net has over 1,000,000 posts about all things technology related! Over 90% answered within 24 hours! Click here to start participating now! Also, be sure to check out the New User Guide.

sort on 60M lines

Reply to Message Icon

Name: dips
Date: October 9, 2007 at 22:20:39 Pacific
OS: sunos.5.8/8.7
CPU/Ram: 512 MB
Product: Pentium (4)
Comment:

A flat file with 60 million lines needs to be sorted out (and then uniq to be done on it). I want to know that if anybody knows how fast it would be? coz right now we are handling that in our ETL tool (Informatica) because of sorting such a huge amount of data cache fails.
Doing sort and then uniq will help?

dips



Sponsored Link
Ads by Google

Response Number 1
Name: fpmurphy
Date: October 15, 2007 at 16:58:30 Pacific
Reply:

Using the standard sort and uniq utilities to sort 60M lines on a UNIX box will be difficult for a number of reasons. The sort utility typically creates a temporary file and an output file - each of which will be at least the same size as the original file. This means that you need plenty of free disk space.

I suggest that you instead look at some third party application such as CoSort.


0
Reply to Message Icon

Related Posts

See More







Post Locked

This post is quite old and has been locked from receiving new replies. Please create a new posting instead.


Go to Unix Forum Home


Sponsored links

Ads by Google


Results for: sort on 60M lines

Substituting on the line above the match www.computing.net/answers/unix/substituting-on-the-line-above-the-match/2931.html

creating a report www.computing.net/answers/unix/creating-a-report/2561.html

sort single line input www.computing.net/answers/unix/sort-single-line-input/2554.html