|Title||»||Add gene name property to proteins and UI to support it|
At some point we should add a gene name property to proteins, both in the document tree and in the background proteome table. We would need some way to extract gene names from FASTA headers based on regular expressions. This might be done individually for each FASTA file when it is:
- Added to a background proteome
- Imported through File / Import / FASTA
- Pasted into the document
Or we could add a Settings / Options form, with a tab for a list of regular expressions to try on every FASTA header, e.g.
" GN=([^ ]+) "
" GENE_SYMBOL=([A-Z][a-z]+) "
We could pre-populate this list with expressions we know of, and the user could add any formats specific to their work.
When we have gene symbols we could use them in the Edit / Unique Peptides form. People have asked for this to help look for peptides that are unique to a gene when that gene may have multiple isoforms.
We can also add links in some places to allow getting gene information on the web.