strings: Sort output to deal with false positives
Created by: xTibor
The problem:
Running strings
on files with embedded media resources (GUI executables, video game ROMs, etc.) yields a lot of false positives. That garbage data makes harder to spot meaningful strings in the output.
Possible solution:
strings
could sort its output by doing an N-gram frequency analysis on each of the results to determine their meaningfulness and use that metric as a sort key. So English-like results would rise to the top and garbage would sink to the bottom of the output.