getting 9 char from a file...
|
Original Message
|
Name: bengalliboy
Date: August 17, 2004 at 14:28:45 Pacific
Subject: getting 9 char from a file...OS: UNIXCPU/Ram: 450 |
Comment: I would appreciate, if some one help me on this. I have a flat file (fixed length) and I have got position 20 thru 28 SSN number (9char). I need to take all the SSN number and eleimnate duplicate and put it into a new output file. Please note this is a huge file. So I am looking the most efficient way to do this. Thanks again
Report Offensive Message For Removal
|
|
Response Number 1
|
Name: nails
Date: August 17, 2004 at 15:53:49 Pacific
Subject: getting 9 char from a file... |
Reply: (edit)Hi: I'm assuming you want only the 9 characters in a new file. If your datafile is already sorted: cut -c20-28 datafile|uniq > newfile If your datafile is not sorted: cut -c20-28 datafile|sort -u > newfile Regards, Nails
Report Offensive Follow Up For Removal
|
|
Response Number 2
|
Name: thepubba
Date: August 17, 2004 at 18:36:21 Pacific
Subject: getting 9 char from a file... |
Reply: (edit)Try: sort -u -k20,28 inFile > outFile I assume that you don't want a listing of sorted SSAN's. You want a list sorted by SSAN. The above solution will provide you with a sorted list of the values in columns 20-28. If that is what you want, then that is a good solution. You can also try: cut -c20-28 inFile | sort -u > outFile if all you want is a sorted list of SSAN's.
Report Offensive Follow Up For Removal
|
|
Response Number 3
|
Name: bengalliboy
Date: August 18, 2004 at 06:19:47 Pacific
Subject: getting 9 char from a file... |
Reply: (edit)Thanks a lot! This really worked. Now another related question. Now that I got my unique key into a file, I want to take that key and go against multiple files and extract record from them. Note, the files that I am trying to match may contain my key in a fixed position. Say, for file A the key is from 12 to 20 and for file B it is in 0 to 9. I know I can use grep agains the key file for matching some thing like: fgrep -f record_key data_out >data_updated but how do I go against maching those particular fields. Also is there an easy way to go against both the files same time ? Many thnaks again!!!
Report Offensive Follow Up For Removal
|
|
Response Number 4
|
Name: Jim Boothe
Date: August 18, 2004 at 09:09:41 Pacific
Subject: getting 9 char from a file... |
Reply: (edit)See my solution to this last question in your newer posting "Matching files against the key".
Report Offensive Follow Up For Removal
|
Use following form to reply to current message: