sort-uniq-count-rank

As new Insider's Guide classes are no longer being offered, this site is not currently being updated. Please refer to NCBI's E-utilities documentation for more up-to-date information.

The sort-uniq-count-rank command combines a series of Unix commands (including sort and uniq) to sort a series of lines of text alphabetically, grouping together the duplicate lines. The function then counts how many occurrences there are of each unique line, removes all duplicate lines, and then sorts the list of unique lines by how frequently they occur, from most frequent to least frequent. The numerical frequency count is also displayed.

Noy S
Iverson GL
Del Bigio MR
Sawyer Q
Vesci B
Zukerman SL
Iverson GL
Wallace PW
Stern RA
Noy S
Baugh CM
[...]

The sort-uniq-count-rank command will output:

12      Iverson GL
12      Zuckerman SL
11      Guskiewicz KM
11      Kerr ZY
11      Solomon GS
10      Covassin T
10      Stern RA
9       Arnett PA
9       Baugh CM
9       Kroshus E
9       Mihalik JP
[...]

xtract: Storing and retrieving information with variables word-at-a-time

Last Reviewed: July 30, 2021

The Insider's Guide to Accessing NLM Data

sort-uniq-count-rank

Input

Output

Arguments

Example