Monarch geneset OGS2.0

DPOGS214227
TranscriptDPOGS214227-TA5301 bp
ProteinDPOGS214227-PA1766 aa
Genomic positionDPSCF300014 + 854431-865462
RNAseq coverage157x (Rank: top 52%)
Annotation
HeliconiusHMEL0156330.040.77% 
BombyxBGIBMGA005950-TA0.039.85% 
DrosophilaCG42588-PA2e-1122.47% 
EBI UniRef50UniRef50_D6WP059e-4326.97%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WP05_TRICA
NCBI RefSeqXP_001948077.15e-4223.11%PREDICTED: similar to general transcription factor IIIC, polypeptide 2, beta 110kDa [Acyrthosiphon pisum]
NCBI nr blastpgi|2700079693e-4226.97%hypothetical protein TcasGA2_TC014717 [Tribolium castaneum]
NCBI nr blastxgi|2700079692e-4325.99%hypothetical protein TcasGA2_TC014717 [Tribolium castaneum]
Group
KEGG pathwayecb:1000712108e-14 
 K13348 (MPV17)maps-> Peroxisome
Orthology groupMCL25187 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214227-TA
ATGGAATCTACAAAGTTACTTCAGAATGACCCTCCAGAAGTTATGGTATTACCACCTAACATTTTAGAAAAATTGGGCATTGATTTAGGAAATATTCAGCCCTGTAGTAATGGAACAAATGCAAAATCAGGTACCAGACAACCTGAAGACATATCTGTATTGGATATCAATGCACCCAATTATGTACCTGGTAAAAATAATGATGAAGAATTTGATGGATCTTCGATTTTAGAAACAGCTAATTTGATGAAGGCATCAGTTTTCTTAAGTCCCACATTTACTAATTTGAAGGCAACCAACGGTTTAAATGATCTCGAGTTCAATAATTTTGAAAACGCTATAGCCACCATAGAGTGCAATGTTTCTAAACCTTTAAAAAGTAATAGCCCAAAAAACAATAGAGTAGTTAATATAACTTTAGAAAATAAATCAAACAGTTCCAACGGTGTGCAGAAAAATAATGCCAGAGGAAAACTACAAAAGTCAAATAATAAAATAAACATAATATCTGAAGAAATTATTGAGTTGGAGAAAATTAAAGATTTAATAACATCTGAGATACTGGTTAGCAATAATCTTGCTTCTGTGCCTATTAACACAATAATTACAGAAAAAACTATTAATATCAAAAATGGTGCAGCTTCAGTACCATCAAAGGAGAAAAATAATAATGATTGTAAGGATAACAATGATAAACCATTAAATGATTGTAAAGCAGTAGAAAATAAAGTTGCCAACATAAATAACGGAATTCAAGATCAATTGGACAAATCAATTCAGTTCCAAATAGATAGCGATGGTTTAAAGTTATCAAAGGAAATAAATTTGATAAATAATACAAATGGAGATATAATTATGTACAAGAATCATGAAGAAAAATGTTCAGATGAAAACCAGGATCTTGTGAGAAATTGTGACAAAGATTTAAAGATCACGTCTGAGCAGCAAGATGAACATGCTACTTCTAGTTCAGAAGAAATTGTAACTATTATTCATGTTATAAATGAAAATGTTGATGAGGCAGATGTGAGTAAAGATTTAAATTTCTCCATTCCTGATAGTGTATGTCACTTCACTGTCACAGACACAATTGATATAGTAGTTCCCGAACATTCCGAACTTAGCATTAACTATGATCAGGAAAATAAATCTTGTGATGTAAAAACTTCAGCAGACGAAAAAATTTTAGGCGACAGAAAAGATGCCAAAATTAATAACAAGAACCTAGAAATAAGTACTAGAAATAAAATTATAAATGTCAATCAAAACAAAGTTAAATCAAAGCTGTGTATACAGGGTGTTGAGAAACCCGGTGGAGATGATGATAATGTAAACAAAGATTTTGGTTATTTTTTCTACTACGACAAAGATGATGATTATCAACATATACATTATTGGGAGCAAAAAGGTGCTGAATGTTCTCTAGATACATTACTTAGTATTTATGAAGATAACAAAGTATATGATGTAAATATAAACGAGCCTGAGTATAATGAAATAGAAACAGTCAAAGAAGACGACCAAGAACAGAACAGTACAGATGTTTGCAATATATGTTGGATGAGTGTAAATAGTGATATTGAAAATGAAGATTTAGTCGAAATTAATAAAGAAGCTGAAACCCGTCAAATAACTCAATCTATTGAACCACAGGTTAAAAATACTGATGAAGGAAAAAGAAACAAACCATTACATTCTCAAGATGTTAATAGAGAATTGCAAGATAAGAGAAAAAATTCTTTATCATTAAACGATGTTAGTCCAGAAGAAACTAAGAAGAAAAAGATGGAAGATGTTATAGAAAATAGAAATATTTCGTGTGGTTTATGTAAAGCTAAGGTGGTAACTAGTGAATGGGATAATCATGTTAGGGATCATTGTTTTATAGCCTGGGAAGAGGGTCAGAAATTTGATTTCAATAATGATCTTCTATGTGATCGACTCAAGCAACATTTGAATGATAACGGAAAATTAGTGTGCGTGATATGTGATAAAGTTATTAAGCGGGTGAAGAAATTTTTAAGCCATGTGGAAGGATGTATTTTATATGGTCACGTTGAACAATCGGATGATTTAAAAAATGTTATGTCGCGTCAATCGAAACACACAGTATGTGGAGTTTGCAAGACACAAGTCCCCAAAAGCTCATGGATAGAACATATAGCTAAAGAGCATAATTATATAGCTTGGAAAGATGGAGATAAAGTATTGGATGTTGGAGATGAAGAAGCAGTTAAAAGGCATTTGAATAAGTTGGCTAGTGATGTTAAAAGATTCGAGTGCAAATCCTGCGGCACAAAACGCAAGTGTTCGGACTCTTTCTTTAAACATATTCAAAAATGTGGAAAATTAGGCGAAGGAATGGATACAGCAGACAATTCGGTAGGAGCTGAAGACGATACGTCAGTAACATGTGGAGTGTGTCAGAACAAGATGTCAGCTAACGAATGGCAAAACCACCAATTCAAAGAACACAAATATCTAGCGTGGAAAGCTGGGGAACAAGAGTTGGATCTCAATGACACTGAGCATGTGTATTCACATTTATATAATCTGTCCAAAGATCTCGGCGGACTTCTATGCAGTAAGTGCGGGTGTCGACGCAAGTATGTTAATTCATATTTGAAGCACATTGAGAAATGTGATGGGGAAGAGAATCCAAACTGTACATTGGATTCCACTATGAACGAAAGTAATGTGTCAATAAACAAAACCTTGGGAGACGATGTGACTGGAGACTTAGAGGGAATTGTTAGATGTGGAGTGTGCTCAAAGGAAATTGATAGAAAGCAATGGGAGAATCATATAAAAAAAGAACACTTCTATAAGGCGTGGCAGGAAGGACAGAAGCCAGTGAATTTAGATAACGAAGAGGATGTGTATAACCATCTGTATGCGATGAGTAAGAAATATAAGGGTCTGGTTTGCAACAACTGCGGCACCAACAGGAAATATGTTAAGACATTCCTCCAACATATAGAGTCCTGTAACTCCCAGGACTCGTTTATCACAGATGAAGTTCTTAAACAAGAAACCTGTAAATGCGGTGTTTGCGGCGAAGAAGTTCCAAGTAAAATGTGGAAGACCCATGCTATGAAGACACATTACAACGTGGCGTGGCTGGATCAACAGACACCTATTGATACCAACAATGGAACAGCGGTTGAGAAATGTTTGAAGGAATACAAACAGGCTTATAATAAATTCGTTTGCAATGTGTGCGGTATAACGCGGGTTTCCGCCGTGGGGTTCTTCGCCCACGTGTTGCAGTGTGGGAAAACTGAAGAGGAAATCGATAAACATAGAGGCGTCTGCGACATATGCAATAATAAATATTTACTGATATATAAGAATCAACATATATCGATGCACAGAGATCAGGAGTACGCGAAACAGAGAAAGCTGGAGCTACAAGTGGAGAAGGAGGAGAAACAAAAACAACAGAAACACTCTGACGCGTTACCAGAGAAGAGACAAGCTGCTGAGAGAGCCCGTCATGTTATTGAAAAGTATAAAAAGCAATTTAAGCATAACTGTCCCACTTGCGACTTTGGTGGCGATTCCGAAGAAGATTTGAAGAAGCACACGTGCTCGAAGACGAAGTATAACTTTAGCGAGTCCGAAGATTCTCTGCAATTCAGTTCGGAACAAGAATCTGAGGACAGCGACGCTAACTGTGAACTGTTAGAAGAAGAACTAGAAAAGCCTCAGAAGAATAATACTAAGAAAAAAAAACATTCTGACTCAAATCCTGCTACTGCATTCTTACCGTTTCCCGTCAAAAACACGCAGACGTATTTAGCTGAGAGTGCAGAGGACTTTCGTGAGAAGTTCTTAACGAGCGACATACTGTACCCACAGTGGAGGACATGCGAGTACGAGGTCGTGTCAGATGACCTGCTGACAAATTACTTGCCGACCTTGGAGGAATCGTGCAAATTACAGTTACAGAAAGACGAATGGATAGCGTTGAAGAAGTTTGAGTCTGTTAATGATCACAAATGGGTGAGTGCATCTTTCACGGGCGGTTGTATCCAGTGCGTGTCGTGGTGTCCGCCGCACGTGTCGGACGCGGAGGACGAGCTGGGTCACGTGTTGAGCGCGGCCGTGCACGTGTCCCGGGACGCGCCACGCCTCCCCGCCGACACGTGTCACACACACCACGCCATGCTGCAGATATGGGACTACGGGGACATGCGCACGAAGCCAAAATTTGCTCTGGGTATAGCTCTTGATTTTGGGACAATTTGGGCGAAAGATTGGTGTCCATCGGGCACGCGTGACATGTTGAACGGAGAGCCGACAACTTTTAAAAGACTTGGTCTTTTATCTATAGCATGCTCAAACGGTTCAGCGTACATATTATCAGTACCGTATCCTTCAAGCATAACGGACGGGGGGAAAAAGATTTTCAACCTAAAGCCAGTCGCGGAGCTGAGATTGACTCGTGGTGATCGGCGGAAGTATCAAGCTACAGCTATCAATTGGCCAGCGCAAAAAGGGCATTCCACTATAGTAGTCGGATATTCTGATGGAACAACCGCCTCGTATAATCTGTCGTGCGATTCTCCTCTCTTGACCGAAACAGAGGACGGCGTTAAGATATTCTATCCTTATCAAGACGAACGAACACACAACACATGTGTCACCGCGGTGACGTCATTTCCTAGTAGCGGCGTGTCGTGCCCGGCGGGTTCTTCGTCGGCTACAGGCGGCTCGCGGTCCGTGTGTCGCGGAGTCGGCCGCGGCTCTCGCTCCGCAGTCACCGCTACCTCGGCCTGTTTCATGCCGCACTGGCCCGACCTACTGTTGGCTGGGAACGACGCTATCGTATATCAAGCTCCGAACGTGTTGTCGTGGGTGGGGAACGGGCGACGCCTGGGCTCGCAGCAGGCGTGTGCTGGATGCAACACCTGCGGACGGGTGGCGCTAGTGGCGCCGCCCGCGGTGCGACTCGTCACTACACACCCCGTGCATAACGACCTTAATAAAATTACAGTGGCGCTGTTACAAATGAAACCGCTCGTGGATAAGAAATCCAAGCAGAAGAATGACGACCTCGCCACGAGACTTGAGCCGGTGACTTATGAAGACGCCGTCAAGAAATATGGAGTAGAGTTGAAACTAGCTGAAGATTGTGACAAGAGCTACCTCCAACAGTCAAACAAGCCGAGAGACCACTACCCAGAGAGGTTCCCGCTCTCAGACGTGCCAGCTATGGCCTTCAACCTGTCTCCGAAGCAACACAAGAAACTGGCGATCGCCACACATTCTGGATTTATTTTCGTTTTGACTGTATGA

Protein sequence:

>DPOGS214227-PA
MESTKLLQNDPPEVMVLPPNILEKLGIDLGNIQPCSNGTNAKSGTRQPEDISVLDINAPNYVPGKNNDEEFDGSSILETANLMKASVFLSPTFTNLKATNGLNDLEFNNFENAIATIECNVSKPLKSNSPKNNRVVNITLENKSNSSNGVQKNNARGKLQKSNNKINIISEEIIELEKIKDLITSEILVSNNLASVPINTIITEKTINIKNGAASVPSKEKNNNDCKDNNDKPLNDCKAVENKVANINNGIQDQLDKSIQFQIDSDGLKLSKEINLINNTNGDIIMYKNHEEKCSDENQDLVRNCDKDLKITSEQQDEHATSSSEEIVTIIHVINENVDEADVSKDLNFSIPDSVCHFTVTDTIDIVVPEHSELSINYDQENKSCDVKTSADEKILGDRKDAKINNKNLEISTRNKIINVNQNKVKSKLCIQGVEKPGGDDDNVNKDFGYFFYYDKDDDYQHIHYWEQKGAECSLDTLLSIYEDNKVYDVNINEPEYNEIETVKEDDQEQNSTDVCNICWMSVNSDIENEDLVEINKEAETRQITQSIEPQVKNTDEGKRNKPLHSQDVNRELQDKRKNSLSLNDVSPEETKKKKMEDVIENRNISCGLCKAKVVTSEWDNHVRDHCFIAWEEGQKFDFNNDLLCDRLKQHLNDNGKLVCVICDKVIKRVKKFLSHVEGCILYGHVEQSDDLKNVMSRQSKHTVCGVCKTQVPKSSWIEHIAKEHNYIAWKDGDKVLDVGDEEAVKRHLNKLASDVKRFECKSCGTKRKCSDSFFKHIQKCGKLGEGMDTADNSVGAEDDTSVTCGVCQNKMSANEWQNHQFKEHKYLAWKAGEQELDLNDTEHVYSHLYNLSKDLGGLLCSKCGCRRKYVNSYLKHIEKCDGEENPNCTLDSTMNESNVSINKTLGDDVTGDLEGIVRCGVCSKEIDRKQWENHIKKEHFYKAWQEGQKPVNLDNEEDVYNHLYAMSKKYKGLVCNNCGTNRKYVKTFLQHIESCNSQDSFITDEVLKQETCKCGVCGEEVPSKMWKTHAMKTHYNVAWLDQQTPIDTNNGTAVEKCLKEYKQAYNKFVCNVCGITRVSAVGFFAHVLQCGKTEEEIDKHRGVCDICNNKYLLIYKNQHISMHRDQEYAKQRKLELQVEKEEKQKQQKHSDALPEKRQAAERARHVIEKYKKQFKHNCPTCDFGGDSEEDLKKHTCSKTKYNFSESEDSLQFSSEQESEDSDANCELLEEELEKPQKNNTKKKKHSDSNPATAFLPFPVKNTQTYLAESAEDFREKFLTSDILYPQWRTCEYEVVSDDLLTNYLPTLEESCKLQLQKDEWIALKKFESVNDHKWVSASFTGGCIQCVSWCPPHVSDAEDELGHVLSAAVHVSRDAPRLPADTCHTHHAMLQIWDYGDMRTKPKFALGIALDFGTIWAKDWCPSGTRDMLNGEPTTFKRLGLLSIACSNGSAYILSVPYPSSITDGGKKIFNLKPVAELRLTRGDRRKYQATAINWPAQKGHSTIVVGYSDGTTASYNLSCDSPLLTETEDGVKIFYPYQDERTHNTCVTAVTSFPSSGVSCPAGSSSATGGSRSVCRGVGRGSRSAVTATSACFMPHWPDLLLAGNDAIVYQAPNVLSWVGNGRRLGSQQACAGCNTCGRVALVAPPAVRLVTTHPVHNDLNKITVALLQMKPLVDKKSKQKNDDLATRLEPVTYEDAVKKYGVELKLAEDCDKSYLQQSNKPRDHYPERFPLSDVPAMAFNLSPKQHKKLAIATHSGFIFVLTV-