Monarch geneset OGS2.0

DPOGS209763
TranscriptDPOGS209763-TA3462 bp
ProteinDPOGS209763-PA1153 aa
Genomic positionDPSCF300314 + 67611-75905
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0135948e-0932.03% 
BombyxBGIBMGA010171-TA7e-0622.18% 
DrosophilaCG5931-PA1e-1325.91% 
EBI UniRef50UniRef50_D6WVU91e-6544.09%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WVU9_TRICA
NCBI RefSeqXP_970333.23e-6644.09%PREDICTED: similar to HFM1 protein [Tribolium castaneum]
NCBI nr blastpgi|2700117904e-6544.09%hypothetical protein TcasGA2_TC005866 [Tribolium castaneum]
NCBI nr blastxgi|2700117904e-6244.09%hypothetical protein TcasGA2_TC005866 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[112-281] IPR0041797.8e-31Sec63 domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209763-TA
ATGCGATATAAACTTTCCTGTAAAATTTCAAGATACCAGTCCCTGGTAGGTGGTTGTGAGCCCTTACAAAGCTATCTTCACAAACGTCTGGCTGAGAACATCAATAGTGAGGTAGCTTTGGGTACTATCCGTGATGTAGCTCAATGCGTACAATGGCTTAACTCCACGTTCTTGAGAGTTAGAGCTGTTAAGGATCCTAAAAGATATCTTGGTTTACCACAGACAGCGACCGAGCAGCTCATATCGAAGAAAATAGAAGAGTTGTGTATTAGAGCAATGAACGGTCTTCAAAGTTCAGGACTTATAACTATGGACGAACTGGCGTGGATAGAGTCGACCGAGGCTGGCCGTCTCATGTCCATGCACTATCTAGACTTGGAGACTATGAAGCTCATTATGAAGATAGACGGCGATGCGTCGCTGGATCGTCTTTTATGGTTGATATGCGAGAGTCATGAACTATCAGATATGTATCTCCGTACAGATGAAAGAAGATGCCTCAACACACTCAACAGGAACAACAATTCCTCTACTATTAGGTACCAAATGAAGGGCAAGATTACTACACGAGAGATGAAGTTAAATTGTATAATCCAGGCTGTACTCGGCTGTCTTCCAATACCCGATCCCTCTTTAAACCAGGAAGCCTTAAAAATAATGCGAATAGCAGACAGAGTTTGCAAATGTCTCGTCTCGTACATAACACGTCCCAATTTTATATCTGATAAACCGAAATTCTATTCGGCGATTTTAAATTCTATAACATTAGCTAAATGCATCACGGCACATTTGTGGGAGAATTCTTTGTACGTGAGCCGTCAGTTGAAAGGAATTGGACCCACTTTTAGTACGTTGCTCGTGACGGCCGTTGGAATCGACAGACAATGCGAATATTTCTTTCAAGATATCAATCCATTTAGCAATAAAGATATATCTCATAACAAAATTCCCCAAACGATAAGTCCTAAAATACAAACTCATACAGATAAGGAGGGTTGCGATCGCAAAAGAAAAATCAATAATAAAAAATCAATACCATATGTAGAAAAGAAGAAAAGGGAGTCAATAGAAGATTTTTTAACAAAGAAGGAACCGTTTGAAAGAAATACTAAATGTTTGATAGGAAATTCAAACATATCGATGGAAAACAACCAAAAGATTCCAGAAAAATCTGAATCACTTGAAGTAAAATGTGATATCATTTCAAATAACCAGAAAGCAGTTGATAATGAATTTGCATATCAAATTGATGAAGACGATTTCCTTGAGGAATTCGATTACGAAGAAGTGGATGACGAAAAAATAGATGAGTTGTTAAACGGCATAGAAAATGAAGTAAACCGTCCCTCAACATCAGCATCGAGCCATATGTATATACAAAAAACGGAACAAGACCATTATACTAACAGTTCAAAGAGACTTAAAACAAACGATTACAAAGAACCTAAAAATAAAAGTAATATGAAAAATAATTTCACATTCATCGATAGAATCGAAAAAAATATAGAAAATGATAACGATAATATGACAAATAAAACAACTGGTCTTTCGGAAGCAGTAAAAAATCACATTCAACAATACTTACGTGAAACCAAAAGTAATATGAAATGCATAGGGACTAATTTAAGTGAATTAAACAAGAACTTAAATACCGACACACCGAATTTTACAAGTGAAGAAAAGAACGAAGTGAATGACAATGATAATTTCAGACCAGAAATTATAGATTTAACTGTGTCAGAAACAGAAAAATATAATTGTCCAACAAAAAATTACAACGCGGAGGAAATCCGTTCTAAGGCTCTGGGTTCTGAAACTGTCGAAAACACAGACGGGGATATAACACATGAAGTAGCTGAGATATTTCTAGAACACAATGAAATAGATTCGATCGAAATTGAAAATGACAGAGCGGGTACCGAAATTATAGACAAAGAAATGTTTAACATTAAAGAAAAACCTCAAAACGAATGTCTAAGGCTCAGCCATTTTAACGAGAACCCTAAAAAAGATATTAAAACTACTGACATACAACTGACATTAAGAACGTCTCCAGACACTCAGAAAAGGTCCACACAAAATTCCAATGTTATCTGTATCGGTAGTGACTTTGAAACGAGACTATACGATCAAGATTCTGTAAAAAACCCTGAAGCAATTCCAGAAACTTGCTCACAAAAACCATTAAATACTTATAGAACACAAGAAAACGAATCGAGAACTAGCTTGCATCAGGATTTCCAGTCTCCTTTAAATCCCACTAAATATAATTTGGACATTAATGACCGAAATATGAAAAACGAAGGATCATTCAAAGACATAAGAAAATATTCATACAATGAAGTAAAAACTGTTAGTTTTCATAAAGAGAATATAACTAGAATTTGTACTTTAAATGTTGCTTTAGATGTAACAGATATAGTCCACACAAACAACGAACCTATAATTTCTGAATATAATTTTGATGCCAAAAATGAAAGCACAAGTTATTATAAGAAAGACTCAAATACGCCCACAGACCTTAAAACTGACGGTAGTAAAACTTTTATTAAGAGTGATTACTCATATGAATTTAATACGAAACATCAAATAAGAGAGGTGAAAAAAACTGATAAAATTGCTCAAGAAAAAGCTCAAAATACTAATCAGATAATTCAAGGAGAAGATAAAAAAACTACAATTAGAGACATCCTAATGAAGTATAATAAAAATGATCCAATTCAAAGTTGTTCAAAAAACGTTGAAACAGAATCTCTGCTTGGAACATACCAAATAGAGCGTTCAGAATTAAAACCCAGAAGAAAATATAGAATATCCGACTTAGAAAAAATAGATGTTACACTACCGGCATCGTTGATACCAAGTCAAAATACAAACCACACATTAAGTTTTGAAACAAAAACCGAACCAAATCTACATTTTACAAACCTTGAGGAAATACAAGATGAAATTATAACTAATAAAACAAACGATGAACATAACGACAGTTCAACTGATACAAATGAAGATTGCAACAAACTCGACCTAGAAGAAACAGATTTAAATTTAGAATGCGAGTTAACTGTCAACGACTCTTTTAATTTCGACGAAGCTATACTAGACCCTAAAACACTTCTACAGTCGAACTCAAACAATGAAGTTATAATTCCACCACCGGCTGAATTTTGCGATAATTCACCTGACGCGTCTCCACAAATAAATTTTCCTAACAACGATTCTGTATATGATCCACAACATTCAGCTTTAGACGATAATAACGAAGTCTCTCCTACAAGCTTCAATGATGAAATAAAAGATATTTTTACCTCTAATTTCTCACAATACAGTAATTTTGCTGTACACAATCAAAGCTCTTCTGGTATTGTCAGTCCATCCGTGAACAATAGAAACGATCTAGTAAAATCTCGTTCGTATAAATTATCACAATTTAAATTTAACCGCAAGAGGATGTTTAAGAAGTAG

Protein sequence:

>DPOGS209763-PA
MRYKLSCKISRYQSLVGGCEPLQSYLHKRLAENINSEVALGTIRDVAQCVQWLNSTFLRVRAVKDPKRYLGLPQTATEQLISKKIEELCIRAMNGLQSSGLITMDELAWIESTEAGRLMSMHYLDLETMKLIMKIDGDASLDRLLWLICESHELSDMYLRTDERRCLNTLNRNNNSSTIRYQMKGKITTREMKLNCIIQAVLGCLPIPDPSLNQEALKIMRIADRVCKCLVSYITRPNFISDKPKFYSAILNSITLAKCITAHLWENSLYVSRQLKGIGPTFSTLLVTAVGIDRQCEYFFQDINPFSNKDISHNKIPQTISPKIQTHTDKEGCDRKRKINNKKSIPYVEKKKRESIEDFLTKKEPFERNTKCLIGNSNISMENNQKIPEKSESLEVKCDIISNNQKAVDNEFAYQIDEDDFLEEFDYEEVDDEKIDELLNGIENEVNRPSTSASSHMYIQKTEQDHYTNSSKRLKTNDYKEPKNKSNMKNNFTFIDRIEKNIENDNDNMTNKTTGLSEAVKNHIQQYLRETKSNMKCIGTNLSELNKNLNTDTPNFTSEEKNEVNDNDNFRPEIIDLTVSETEKYNCPTKNYNAEEIRSKALGSETVENTDGDITHEVAEIFLEHNEIDSIEIENDRAGTEIIDKEMFNIKEKPQNECLRLSHFNENPKKDIKTTDIQLTLRTSPDTQKRSTQNSNVICIGSDFETRLYDQDSVKNPEAIPETCSQKPLNTYRTQENESRTSLHQDFQSPLNPTKYNLDINDRNMKNEGSFKDIRKYSYNEVKTVSFHKENITRICTLNVALDVTDIVHTNNEPIISEYNFDAKNESTSYYKKDSNTPTDLKTDGSKTFIKSDYSYEFNTKHQIREVKKTDKIAQEKAQNTNQIIQGEDKKTTIRDILMKYNKNDPIQSCSKNVETESLLGTYQIERSELKPRRKYRISDLEKIDVTLPASLIPSQNTNHTLSFETKTEPNLHFTNLEEIQDEIITNKTNDEHNDSSTDTNEDCNKLDLEETDLNLECELTVNDSFNFDEAILDPKTLLQSNSNNEVIIPPPAEFCDNSPDASPQINFPNNDSVYDPQHSALDDNNEVSPTSFNDEIKDIFTSNFSQYSNFAVHNQSSSGIVSPSVNNRNDLVKSRSYKLSQFKFNRKRMFKK-