how to replace these ID with official gene names?

Hi -

Since the identifier is in the first column, Text transformation with sed will work without a complex expression. Try this:

s/\.[0-9]+//

Other options include using regular expressions with tools like:

  • Text reformatting with awk
  • Replace parts of text
  • Replace Text in entire line
  • Replace Text in a specific column

Or, if you don’t want to use a regular expression, the column can be isolated, the “dot” replaced with a “tab”, then all the data rearranged back again into one file. This could be put into a workflow if you plan to run it again. Example tool order: Cut > Convert delimiters to TAB > Paste > Cut.

Also, your IDs are Emsembl transcripts, so choose that as the input type with annotateMyIDs.

Hope that helps!

2 Likes