Hello, how can I find out the highest and lowest expression values in a large matrix of single cell gene expression data whose number of columns and rows is greater than 1000?
You could Sort the file by a specific column. Consider removing any header lines first, as you can always add one back in (exact or just parts of it) later on and some tools might complain about it.
This tutorial has a similar method described.
And this tutorial includes a bunch of data manipulation methods. Most are “UNIX utility” or “spreadsheet functions” analogues and definitely work on larger tabular text files.
Hope that helps!
Thank you for your guidance