Monarch geneset OGS2.0

DPOGS200289
TranscriptDPOGS200289-TA3843 bp
ProteinDPOGS200289-PA1280 aa
Genomic positionDPSCF300026 - 538993-552217
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0000380.070.76% 
BombyxBGIBMGA005562-TA0.068.36% 
Drosophilal(2)k05819-PA1e-17945.14% 
EBI UniRef50UniRef50_D0AB860.072.71%Putative lethal (2) k05819 CG3054 n=1 Tax=Heliconius melpomene RepID=D0AB86_9NEOP
NCBI RefSeqXP_396078.30.042.44%PREDICTED: similar to lethal (2) k05819 CG3054-PA, isoform A isoform 1 [Apis mellifera]
NCBI nr blastpgi|2613359480.072.71%putative lethal (2) k05819 CG3054 [Heliconius melpomene]
NCBI nr blastxgi|2613359480.072.71%putative lethal (2) k05819 CG3054 [Heliconius melpomene]
Group
KEGG pathway 
Orthology groupMCL11786 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200289-TA
ATGGATGAGAAATGTCAACAAGCAATAGGATATTCCACTAAAACAGCTCTCTTAAAATTAAGAGATGATATTCGGAATGTCATTGATGAATATAAACATAAAAATCGCATGAGGCTCGGGATTATACGCAAGATAATATCATTTAATTCAGAAAAAGTATTGTTATCTGGAGTTGCCTACACAGTCACCATAATTAATATTGTATGTTTGCTGTTAACTTATCTAATAGGAGAGACTGTCCTTCCTCAAGGATACTTACTTTGGGAAGCCTTATTTTTATTCATTATTTTGATAGTCAACTTTTTGGTTGCACTTAATGAAGAATACTTGTATCAAAATGAAATACCACATAGAGTGAAGAAGGTTTTAGAAACACTAGACTCAGCGATTGAGAAGTGTTATTGGAAGGAGATCCATTATCCCCATCTATGTGCACCATTTTCACCATGTGTTATACTGCAATGGACTTACAGAGATGGTACTATTGTAAATTTACCATGGGCTTTATTAGTGGAAGGTGATATCATAGTTCTTCGACCTGGACAGGAGGTCCCGGGTCATGTCCAAGGACTAAGCCCAGGTGACCCGGAGTTATTTTTTGGTCAAATTTTTCAATCTGACTCAAAGTTTACTAAAGAGAACTGCAGTGCTCCTAGAGTGTGTACCCCTCACCCTAACAAAGAGTACAGAATGTGTGAGACGCCCTACCTAAAGAACCTCAGACTGGCACTCGATGAAGCTACCAACAGACCCGTCACAGTCTATGAAAAACAAAGGTACTTGTGTACAGTTAAAATTATGGAAGGCATCGTAATGCCTGTGGTTATCATACTTATGGTGGTTGGTTGTATCATTAGACATCTATATGATCTGCCGGGCACCACACATTGGACGGAAGTGTACTTACTACAGACGATTGCTGCTTCTCTGCCGCTGCTGCAAATAATATTCCCTATTATATGGGTCACACTTAATTGTTACGGACTCGCCACCTTCAAACATCTATCTATGACTCGCCCGAAGTACTCTAGACCTAAGTCCTGCTCTATATCGGAAGACGCAAGCGATCCACTCATACAAGTATTGAGCGATCCGAACCTAAGACCTAAGAGCTTCCATGCTTTGGCGAAAACCTTCTGGAATATTCTCATTGGCAAGGAGGACATGGTCACACGAACTTCAAACATAGTCCATGTCTTAGGATCGCTTTCGGCTCTATGCTGCGTAGACAAGAAAGGCATACTGTCATGGCCGAATCCAACAGCAGAAAAAGTATTCTTCCTTCGTAACTCCAGTCCCACGTCGCAGAACTCCAGCAAGACCAGTCTAGTGGGGTCTATGGGACATCACAGCGAAACTACCTTGGACCAGACTGGAGACAGCAAACAGAACACCGAACCGTCCACCGTTGTTGAGGTCCTGGACTTGACCCACGACCAGAACGCGCCGTTCCGTGTAGAGTTCGACGAACAGCGTTGGCGGACACACCTCGCTCGCCTCAAGCCTCTGGGTCTTGCAGCTCTCCTCAACACCTGTGCCGCACAACCGACGCACACATACGCACACTTCTGTGCACATCTCACATGCGAGGCACAGCACAGTGAGGATTTAGTTCCAGTCACCAACAGAAGATGTCTCTGTGAGTTGGCGAAGCAAATAGGATTCACGGACTCGGTCACTGACTCCTACAGTATACAGGAGTGTTTGGCCGTTTTTAGACATTTGCAAGCGGATACAATAAGACGTGAAACTCGATTCGTTCGTTCCCTCCACCTCAGTACACGTGTGAAGGTTCCTCTGCCACACATGCTGGCGGTCATCGTTAAAGATCACGCGAATCAACTACAGATGCTGTCACAGGGTACGGCTGATATAATATTGGATTCGTGCGTAGACTTCTGGAACGGTCGCGACCTGACGACCATAACGCCCTCCGATAGGAAGAAGATAATAGATTTCTACCAACGGAACTCTCTCACGGCGTACTGCACCGCATTTGCTTATAAACCGCTCACCCGCGGCGTGGCTCCGTGCGTGTCCCGTCAGTACCTGGAGCTGCCCGCTGACAGCCGGCCGTTGTACGCTCACTGCCAGCTGTGGCCGGACGCGCCCGCGTTACACTTCCACTCTACAGACTCTTTGTTGTTCAACGAAGTGACGGATGACGACGTCGATGATGCTGACGGATATTTTGACATGCAATGTAATCAGGTTTTCATTGGTATGGTGACGATGCAGTACCAGGCTCAGAGCGATATGGTGGAACTGATAGAGCGTTTGGAGCGAGCCTGCATACGATTTGTGCACTTCAGTAAGGAGAACGAGCTCAGATCGAGGGTTTTCAGCGAGAAGATGGGCCTAGAGTCTGGTTGGAACTGCCATATATCACTTATGAATGATCAGAAACATAGCGCGGCCGGGTCTCCGCTATGCTCTGAGACACAGCGCTCCAACGCCCACGCTAGGAACGAACTCACTGACGCCTTAATATATAGCGGGAATATGGCGGCGTCGAAGTCTTTGTCGATGTCCGCGCCCGGGGCCATCAACGTTGAACACAGTACAGTTAAATTTGACGACGAAATAAAAAAAACACGTCATTCTATAACAGCACACAGCGGACTAAAGGAGGTTCAGCTGAGCAACGCCAGCCAGGCGTCGACGGACTCCCTGCTGGGGTCTCCGGCGGACGGCTGCCGGTCCCTCTCCTGCCTCACAGACTCCACCGACCACAGCGCGCCGCTCAACTTCGACATGAGCAACCGGGTCTGTACCGGCCGACGTGACGCTTTGACCCCACCGGACATTAATTCCATCATTGTTTTTGTACCCTTATTGTTTTTAATATTTAATAACATTTATCTTCAGGATTACGGTGAAGTAGTTTGCGTAATGGGTTCAGCGGCCAACTGTCAAAATATGGAGATTTTCATGCAGGCGGATGCTAGTATCGCTCTAGAACCGCTGTATCCTGTTCTGTGCCAAAAAATGCCTCCCTATGAAGTCCCAGACGACTGCATCGGACCCATCGACCTGGCGAACGTACTGAACTCTGTGCCGTGCTCGCTGTCGATGTCGCGCGAGGCGGACGTGTCGCTGTTCGCTCTCATCGCGATGTCGCGTCACTTCATGATGTGTCTATGGAACAGCACGCAGTTCTCTATTTGCAGTTCCTGCTTCATATCTTTTATACAGGTGTGTTCCGCGCTAGTGTGTCTCCCCGGTTGCGTGTCGGCTGGTGGTTGCCTATGGTCGGTGTGTGCGTCGGCGCTCCTGGCCGCATCATTATCTGGAGCCCCGGCCTCCCTCGCCGCGCCTCTCATGCTGCGAGCCGCCACTAGACCTGCTCTCGTACTAAACGCCAGAACGGCGCTGTTCGTTTTCTGGTTCTACGCGTGCAAGTTCTTACCCGCCGCTGTGTCTATATTGATGTTACACGCTTTGACATTGAGGAGTTTCTGCGACCAGATAGCCGATCACACCCAGACGACGTCCTGTTGGATCGTGTACCCGGTTAACATCGGAAACTATACGGCCGACCGCGACCTGTCTGAGAAGAGCTGGCAGGGGTGGGGGGACGACTTCTACGACGGGATCCTGACACAGTGTGTGTTCAGCGGCGCGCTGTTGCGAGCGTCGTATGTAGACTGCGAGGCGTGGTCGATGTGTTCGTTCCGCTACCACGTGCCGTGGCCGTTACTCGTCGGTTACGGAACCTCACTGGTTTTTGTATTCGGCCTCAACGAACTCATTAAATGGCAGGAGATGAAAGTCGACGCGCGAAACCAGAGACGCGCCAGATTGGATTTTGGAACAAAACTGGGGATGAATTCTCCATTTTAA

Protein sequence:

>DPOGS200289-PA
MDEKCQQAIGYSTKTALLKLRDDIRNVIDEYKHKNRMRLGIIRKIISFNSEKVLLSGVAYTVTIINIVCLLLTYLIGETVLPQGYLLWEALFLFIILIVNFLVALNEEYLYQNEIPHRVKKVLETLDSAIEKCYWKEIHYPHLCAPFSPCVILQWTYRDGTIVNLPWALLVEGDIIVLRPGQEVPGHVQGLSPGDPELFFGQIFQSDSKFTKENCSAPRVCTPHPNKEYRMCETPYLKNLRLALDEATNRPVTVYEKQRYLCTVKIMEGIVMPVVIILMVVGCIIRHLYDLPGTTHWTEVYLLQTIAASLPLLQIIFPIIWVTLNCYGLATFKHLSMTRPKYSRPKSCSISEDASDPLIQVLSDPNLRPKSFHALAKTFWNILIGKEDMVTRTSNIVHVLGSLSALCCVDKKGILSWPNPTAEKVFFLRNSSPTSQNSSKTSLVGSMGHHSETTLDQTGDSKQNTEPSTVVEVLDLTHDQNAPFRVEFDEQRWRTHLARLKPLGLAALLNTCAAQPTHTYAHFCAHLTCEAQHSEDLVPVTNRRCLCELAKQIGFTDSVTDSYSIQECLAVFRHLQADTIRRETRFVRSLHLSTRVKVPLPHMLAVIVKDHANQLQMLSQGTADIILDSCVDFWNGRDLTTITPSDRKKIIDFYQRNSLTAYCTAFAYKPLTRGVAPCVSRQYLELPADSRPLYAHCQLWPDAPALHFHSTDSLLFNEVTDDDVDDADGYFDMQCNQVFIGMVTMQYQAQSDMVELIERLERACIRFVHFSKENELRSRVFSEKMGLESGWNCHISLMNDQKHSAAGSPLCSETQRSNAHARNELTDALIYSGNMAASKSLSMSAPGAINVEHSTVKFDDEIKKTRHSITAHSGLKEVQLSNASQASTDSLLGSPADGCRSLSCLTDSTDHSAPLNFDMSNRVCTGRRDALTPPDINSIIVFVPLLFLIFNNIYLQDYGEVVCVMGSAANCQNMEIFMQADASIALEPLYPVLCQKMPPYEVPDDCIGPIDLANVLNSVPCSLSMSREADVSLFALIAMSRHFMMCLWNSTQFSICSSCFISFIQVCSALVCLPGCVSAGGCLWSVCASALLAASLSGAPASLAAPLMLRAATRPALVLNARTALFVFWFYACKFLPAAVSILMLHALTLRSFCDQIADHTQTTSCWIVYPVNIGNYTADRDLSEKSWQGWGDDFYDGILTQCVFSGALLRASYVDCEAWSMCSFRYHVPWPLLVGYGTSLVFVFGLNELIKWQEMKVDARNQRRARLDFGTKLGMNSPF-