Monarch geneset OGS2.0

DPOGS207297
TranscriptDPOGS207297-TA2133 bp
ProteinDPOGS207297-PA710 aa
Genomic positionDPSCF300008 + 709709-712260
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0021790.060.25% 
BombyxBGIBMGA012028-TA6e-8445.63% 
DrosophilaCG4611-PA1e-7248.74% 
EBI UniRef50UniRef50_D6WG401e-11538.18%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WG40_TRICA
NCBI RefSeqXP_001657591.13e-13439.97%hypothetical protein AaeL_AAEL006227 [Aedes aegypti]
NCBI nr blastpgi|1571126185e-13339.97%hypothetical protein AaeL_AAEL006227 [Aedes aegypti]
NCBI nr blastxgi|1571126182e-12839.97%hypothetical protein AaeL_AAEL006227 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[134-167] IPR0028851.2e-06Pentatricopeptide repeat
Orthology groupMCL11280 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207297-TA
ATGGCATTTCGTTGCTTAAACTTACTGAGAAATTTAAAGAACAACAGTTCAACTATACTACGACAAAGTTTATTATCTACAACTACTATAACGAATATAACTCAACAAAAAATACAAAACATCAAAAAATATAAAACTGAAAATAACATCACATATTTAGAAGATCCAGACACCTTTGGAACCTTGTCTGGACAAAAAGTCATCAAAGAAGCATTAGAAGATGAAGGAGATATACAGGAAGAAAAGTATTTGCAAGAACAACCATTAAAATCTCAAAAGTTAACTATTAAACAGTATGCAGACATAATAAAGCAATATTTACAACATAAAAGGATCAAAGAAGCTATTGATGTTTTAGAAACACGTATGCTTAAAGAAGACAGAGTTAAACCTGAAAATTACATTTATAATATACTTATTGGTGCATGTGCTGAGGTTGGATATACAAAGAAAGCATTCCAACTATATAATGATATGAAACGACGCGCATTACGCCCCACAGGTGATACATACACTTGTCTCTTTGAGTCATGCATAAATAGCCCATATCCCACATATGGATTGAAAATGGCTACCCACCTTCGAAACCTTATGATAGAAAAAAATATTGAGCCTAATTTAACTAATTATAATGTTATGATCAAAGCATTTGGTAGATGTGCAGATCTACAAACAGCTTTCAAAATTGTTGATGAAATGATATCAAAAAAAATCAAGATTAGATCCCACACCTTCAATCACCTGCTGCAGGCTTGTATAACAGATAAAAATCACGGTTTGAAATATGCTCTCATCGTTTGGAGGAAAATGTTGAACATGAAGGAGAAACCTAATCTATATTCATTTAATCTAATGTTGAGATGTGTTAAAGATTGTAATGTTGGTTCTAAAGAGGATCTTCTAGAAGTTATTGGAATCATTCAGGCAAGTTTACCAATAAGATCTGTAGAAAATTTAAAAGAAATAGAAGGACAGCAACAAAGATTGTTATCTGGTGCTCCAAATGATAGAAAGATGACTGAAAACCAATCATTTGTTTATTCATCTAGTGATTCGGAAAGGAATATTATCACTGAAGCAAATGAAAAACATGTTAAACCAACAGACTTCTCAAAAAGCCATTCTGATGGAAAGTGTTCATCAACAACAGATCCTACAGATTTAGAACTAGTTACTCAAGATCAAAACACATTAGAAGTTATTGAAAATTATAAAAAAACAACACCTCTGCCTAAGAGATATGCACCCAACTTACTTTCAAGAGTCGTACAAATAGATCAGGTGCTAGCGTTCCAAGATGTCAGTACATCACAAGAAAAATTTGCAATAATAGGGGGACAAGAAGATTTTCTCAATGAAATGGAAGTTTATTCTATTAAGCCAGACATTAAAACCTTCACACAGATGTTGCCTCTCATAGAAAACAGTACAGAAGCTGAAATTAAACTAATAGATACAATGAAAACATTGAAAATTAAAAGGGATATAGATTTTTATAATATGTTAATTAAGAAAAGGTGTCTCAGAAAGGACTATGACAGTGCCTTTATGGTCAGGAATTTGATAGAAGAGGATAATGCAAGTGTGCGGAAACATCCATTCAATAAAAAACATAAATTAAAAGTTGATGTAATGACATATGGCGCTTTAGCGTTAGCGTGCACTACAAGGGAAATGGCCGACAAATTACTTAACGAGATGAAGGAAAAACAATTGAAAGTTAACATTGAAATGTTAGGAGCATTATTAAAAAATGCTGCCATTAATATGCAATTTGGATATGTACTTTATGTTATGGATATAGTTAAACAAGAAAAACTAAAGGTCAACACTGCCTTTTTAAGACATTTGGAGACTTTCAATGACAGATGTATAAGAAGTATAGAGAAGAATGAAAAAGAGAAGAGGGACAGTCCAGTACTCAAGGCGGCTTATAACAGGTTTAGTGACATTTATAATAATTGGATAACAGATGTGAATGTGGAAGAAGCCTTGAAACCTGAAAATCCATGGAAGCAATTCACAGAACCCCATCCCGCTACAGTACAGAGAGAAAATTTCCAAATAGTTGAACCAAAAAGGTTTTATAAGAAAAAACGTCACTACAAACCATATACGCCAAGGTTGAAATAA

Protein sequence:

>DPOGS207297-PA
MAFRCLNLLRNLKNNSSTILRQSLLSTTTITNITQQKIQNIKKYKTENNITYLEDPDTFGTLSGQKVIKEALEDEGDIQEEKYLQEQPLKSQKLTIKQYADIIKQYLQHKRIKEAIDVLETRMLKEDRVKPENYIYNILIGACAEVGYTKKAFQLYNDMKRRALRPTGDTYTCLFESCINSPYPTYGLKMATHLRNLMIEKNIEPNLTNYNVMIKAFGRCADLQTAFKIVDEMISKKIKIRSHTFNHLLQACITDKNHGLKYALIVWRKMLNMKEKPNLYSFNLMLRCVKDCNVGSKEDLLEVIGIIQASLPIRSVENLKEIEGQQQRLLSGAPNDRKMTENQSFVYSSSDSERNIITEANEKHVKPTDFSKSHSDGKCSSTTDPTDLELVTQDQNTLEVIENYKKTTPLPKRYAPNLLSRVVQIDQVLAFQDVSTSQEKFAIIGGQEDFLNEMEVYSIKPDIKTFTQMLPLIENSTEAEIKLIDTMKTLKIKRDIDFYNMLIKKRCLRKDYDSAFMVRNLIEEDNASVRKHPFNKKHKLKVDVMTYGALALACTTREMADKLLNEMKEKQLKVNIEMLGALLKNAAINMQFGYVLYVMDIVKQEKLKVNTAFLRHLETFNDRCIRSIEKNEKEKRDSPVLKAAYNRFSDIYNNWITDVNVEEALKPENPWKQFTEPHPATVQRENFQIVEPKRFYKKKRHYKPYTPRLK-