Hello
I have files in different formats which I'm
trying to prune down to get rid of
equivalent strings.
For example, a tab delimited file contains
the following:
a b
c b
c a
d e
In this instance I would like a output file that
just contains either a OR b OR c (because a,
b,c are equivalent as a finds b, c finds b
and c finds a) AND one of the letters d OR e
(i.e the output contains a and d for
example)
However, another file format that I have is
a b
c d
e c
In this case I want to output either a or b
AND d AND e so the file would contain a, d
and e for example(because c appears on more
than one line but d and e don't find each
other therefore aren't equivalent
themselves)
In some of my files there can be any number
of equivalent strings with all their
different combinations.
Thanks for any help,
Gayle