Monarch geneset OGS2.0

DPOGS210418
TranscriptDPOGS210418-TA1914 bp
ProteinDPOGS210418-PA637 aa
Genomic positionDPSCF300062 - 518247-525046
RNAseq coverage752x (Rank: top 17%)
Annotation
HeliconiusHMEL0216047e-11575.97% 
BombyxBGIBMGA008874-TA3e-1427.48% 
Drosophilacasp-PA2e-11239.97% 
EBI UniRef50UniRef50_D6WR394e-15846.40%Fas (TNFRSF6) associated factor 1 n=2 Tax=Tribolium castaneum RepID=D6WR39_TRICA
NCBI RefSeqXP_975449.23e-15445.65%PREDICTED: similar to FAS-associated factor 1, putative [Tribolium castaneum]
NCBI nr blastpgi|2700110902e-15746.40%Fas (TNFRSF6) associated factor 1 [Tribolium castaneum]
NCBI nr blastxgi|2700110901e-15246.40%Fas (TNFRSF6) associated factor 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.7e-11protein binding
KEGG pathwaycel:C28G1.19e-13 
 K04649 (HIP2, UBC1)maps-> Ubiquitin mediated proteolysis
InterPro domain[319-464] IPR0065779.1e-28UAS
[556-636] IPR0010122.7e-11UBX
[1-47] IPR0090602e-08UBA-like
Orthology groupMCL12670 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210418-TA
ATGGCTGAGAATAGAGAAGAAATTCTTGCGAATTTTCAAGGAATAACAAATATTGAAGATGTTGCTGAAGCAATATTCCATTTAGAAGAAGCAAATTGGGATTTACTTTCTGCCATAAACCGAGTGATGCCTCAAGATGGAAGCAGCTCAGCATCCCATTCCAATGACACTCACGATGTGGAGATGATAGATGACGACATATCTGTGATAACTCCAAAAACTCACCCACCCGATCGGGAAGACAGTCAAGCCTCAACATCATCCCCTAGAAATAATTCCCTCAACCTAGTAGAGTTACAAGTACATTGTAATAATAAGATACACGAGATCAAAATATCAGGTTCAGCGACAGTCAATGATCTCAAGAAGCGTTTGGAACTGGTATGTGGAATTCCAGTTTGCAGACAGCAAATATCTGGTCTAGGTGGTTCCAGAGCGACCTCCACTGCTGCATTGTCGACACTCGGTCTAACAAGGAATGCTGTTCTACGGCTGAAGGCTGCCGATACTGTCATGGCTGATGATGAGGTAGCGGAAAGATTGACAACGACCTACGTACTCCGAGTTAAGCATGACGAGAGGGAATACACACTCAAATATCCTGGAACGAAAACTGTACAAGAAGTCAAAAACGACCTCTACTCACTCACAGATATACCCGTCAGATATCAAGTATGGACGGGCTGGCCGACTGTGTCAGGTCTGGATGACACGGTGTTAGCTATGGTCGGTCTAGATCTACCACAGCATGAGCTCACAGTCAAAAGAGCACCCAGATTCAAGGAGTACAAAAGAATTATAGTTGAAAGTTCAGATAGTGAGAACAGTTCGGTTGAAGAAGTAGAAGACGGTGACAATTTCCCAACGGAGGATGATATGTTTGTCGATGTAACCTCGGAAAGACTGCAGCCATTGATGTCGGATAACATTGAAGATGAAGCCCTCGGCTGTATAGAGTTCTCACAACGGTTCCGAGCCCGCTACGGTCCTAACACACCTAATTTCTTCGAGGGTACCCTACATGATGCAATCAAAGAATCCTGTCTCAAACCAGCCAATGAGCGTAAGCTGTTAGGTGTGTACCTGCATCACGAGCAGTCTGTGTTGTCTAACGTGTTCTGTGCACAGCTTCTGGGATGTGAAACTGTGCTGCAAACCCTCGCAGCTAACTTTGTGCTATACGGCTGGGATCTCACACATCCACATAATAATAATATGTTGTTGTCATCTATAGCCAGTTCCCTGGGTCCTGTCGCCAGTATGACTGTCCGCAGCATCCCAGTGGAGCGTCTCCCCGCCTTGCTCGTCATCATGAGAGTGAGATCCAACACCGAAATATACTCCGTCATTAATGGTAACGTGGGCGTGTCTGAGCTGGTTGGTGGTTTAGTGGAAGCGCTTGAGAGGTTCGCCGTGCAAAGGGCGGAGGACGCGAGGGTCGAGAGGGAGCGGGACGCGCGGCAGAAGGTCAAGAGAGAGCAGGACGAGGCCTACCAGCGGAGTCTAGAGGCGGATAGAGCGAAAGAAGAAATAAAAAAACAGCAGGAACTAGAAAGAAATCAAGAATTAGAGAGAGCGGAGTTAGAAAGACTTATGGAGGAGGCCAAAAAAGAGGAGCAGCGTGCCGGTGCCGCAGCCCGAGTGCCGTGTGAACCTGCGGCGGGTGCTGCGGACGTGGCCCGCGTGCGAGTGAGACTGCCGCCGCCCCACCACGAGTGTCTCGAGCGACGCTTCAACGCCACTGACACGCTCGCGGCGTTGTTAGACTTCCTCGCCTCAAAGGGTTACCCACAGGAGAACTACAAAGTAATAGCTAGCTGGCCTAGAAGAGACCTGACGATGGAATCTCACAGCAGCACATTGAAAGCGTTAAAGCTGTATCCGCAGGAGACGGTTATGTTGGAGGAGCGGTGA

Protein sequence:

>DPOGS210418-PA
MAENREEILANFQGITNIEDVAEAIFHLEEANWDLLSAINRVMPQDGSSSASHSNDTHDVEMIDDDISVITPKTHPPDREDSQASTSSPRNNSLNLVELQVHCNNKIHEIKISGSATVNDLKKRLELVCGIPVCRQQISGLGGSRATSTAALSTLGLTRNAVLRLKAADTVMADDEVAERLTTTYVLRVKHDEREYTLKYPGTKTVQEVKNDLYSLTDIPVRYQVWTGWPTVSGLDDTVLAMVGLDLPQHELTVKRAPRFKEYKRIIVESSDSENSSVEEVEDGDNFPTEDDMFVDVTSERLQPLMSDNIEDEALGCIEFSQRFRARYGPNTPNFFEGTLHDAIKESCLKPANERKLLGVYLHHEQSVLSNVFCAQLLGCETVLQTLAANFVLYGWDLTHPHNNNMLLSSIASSLGPVASMTVRSIPVERLPALLVIMRVRSNTEIYSVINGNVGVSELVGGLVEALERFAVQRAEDARVERERDARQKVKREQDEAYQRSLEADRAKEEIKKQQELERNQELERAELERLMEEAKKEEQRAGAAARVPCEPAAGAADVARVRVRLPPPHHECLERRFNATDTLAALLDFLASKGYPQENYKVIASWPRRDLTMESHSSTLKALKLYPQETVMLEER-