This is really a small, trivial task for a perl script. Even with a billion lines this is nothing for a modern cpu and perl.
https://www.reddit.com/r/perl/comments/18ygpsi/1_billion_row...