Monarch geneset OGS2.0

DPOGS211263
TranscriptDPOGS211263-TA3909 bp
ProteinDPOGS211263-PA1302 aa
Genomic positionDPSCF300506 - 10797-25074
RNAseq coverage94x (Rank: top 62%)
Annotation
HeliconiusHMEL0095574e-2629.41% 
BombyxBGIBMGA001611-TA3e-1142.27% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|1568445261e-5926.90%hypothetical protein Kpol_1058p4 [Vanderwaltozyma polyspora DSM 70294]
Group
KEGG pathway 
Orthology groupMCL10526 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211263-TA
ATGTTTATAACGTATTTAGTGCTAGCTGCGACGCTTCATGTTTGCACGTCGGCCGCGGAGCCTGTAGCCAACCGCCCGGCCCACACCGTCACTGTTTTACAACGCAGAAGATATCAAGGTTCGGAACAGATAGCGTCAGAAGAATATAACACAGAAGAATACCACAGCAAAGAAGACGAAGAAGAATGGGAATCGAAAGAACAGAAATATTCTTTTGAGAACGCCTCCGACGAAATCGTCGAGGCTAAATCATTAGGGTCATATCATAACGAGACTAACCGCGATAGTAACCTTGTCGATAACTTGCCTGGCGTGATTAACGATTCTGTAGATAAAATATACCGTTATACAAAACCCATGGAGGCGGCGGCGGACCTGACAGATGAGACGATGGTTGTAGCTTACTTCAAGACAACGGCCGGCTTGAAAACTGACGAAGAAGCGTACGAGTTTCTGAAGACATGCAGAGGAGCCAGGGGTATGAGGTTCACAGTGAAACTGCTCAAGAAACTAACGGGCACTGCGGATCTGAAGGTTTACGTTCATCTCATCAAGATAATAACCGGGGAAGACGATGCCGTACAGGGTCTAAGGGTGATACAGAAACTGACAGGACATTATGACATTGACGGAATTATTGAGATTCTTCAGCGGATATCCGGCAGAATAAGCATGTCGGACCTCATTGAATTCTATGAACACAACGGAGATCACACGACGGGTCTGGAGGGTATCAGTCACCAAATTTACAAGCTGACGGGAGACGCCGGGCCCAAGGAGTTCCCGGGGGTCTTTGAGGCCTTGAAGTCAGAAGTTGACTGGACGGCTACCTTCAACGTGCTTGTGACTGTGATGAGATGCAACGACATCAGCGAGTCCATCGGTGATCTGAAGAAGCTGCTCGGCGTGCAGACCTTCCCGGAGGCGATCGACGCCATGAAGCTGGCGACGGAACAGATCCATTTCAGATACGCGCTGGAATCATTCCTGGAGGAAACTGGGAGAGTTAATCTGAAAGTCATCATTACGACTCTGTATCAGCTGACAGGAAAAACGGATCTGCTATCTGTTCTCCGAGAGCTACAGAGACTGTACAAAATTAATAATATAATAAAGGTATTTCAGACAGTGAATCGCATTTCCAAGAAGAGAGATCCGCTGGTTGTGCTGAACAACCTGGTGGACATCACGTCCGCAGCCAACCTGAGCGACTGCAGCTCCGTCTTCACGGGACTCACCTTCAAACAGATGACGGCGCTGGAGGTGATCGAAAAGATTCAACTGGAGACAGGATGCACCGACCTCGTGGACTTCCTGAAGGGGATGCTCTCCGTCACAGGAAGCAGAGGTCTCCTGAAGTCCTGGCAAGTGATCGTGACCGTGACAGAAAAGAACGATATATTTGTTTTGATAAGTACACTGAAGCAGTACACATCAGCGGACCTGGTCACCGTGTTTCGTTTCCTACAACGGATTACAAACACCATCAATATCGTTCAAGCCGTCGACAAGATCAATGATTTGCTGGAAGTCAAAGGTTTCTTTGAATGGACCGAGACACTTCATCACGCCGCTGACGCCGACGTGTTCGAGTTCTTGCAAGCCTTCACTAGCTTGAGTCCGCGGTCATTATTGGAAGAAGCCATTGAAACAACTATGCATTATACCGATGTATACGAACCAATACAGGCTATCGAAGATTTAAAAGCGGTTACCCAACAAGACGACATAATTAAAGCTATAAACTACATAAAGGCGAAGAAACCTAAGTTGACCGTAACAACAACAACAAAACAAGCGCAAATAACGACAAACCTAGAAACAAGCGCTGAAACAACAGAGAAGCTAATTACGGAACACGAAAATGCTTCCGACACATCAGAACACTTGGATCACTCTAAAGAACAGACCACGAATGAACAAAAGCCAGAAGAAACAGAACACAGCGTGGAAACCTCAACTAAACCAGCAGAATTAACGACTGTAGCATCTTTAGAGACAACTACAGCACCAACCACAGAAGCGGCTTCAGAAACCACTTCAGAGGCAACTTCGGAATCAACCCCAGAGGCAACTTCAGAATCAACCACAGAGGCAACTTCAGAATCAACCCCAGAGGCAACTTCAGAATCAACTCCAGAGGCAACTTCAGAAGCAACCACAGAGGCAACTTCAGAATCAACCCCAGAGGCAACTTCAGAAGCAACTCCGGAACCAACAGCAGAACCAACGGTACAGTCAACTACAGAATTAACTTTAGGCCCTACCACAGAGTCAAATATAGCACTGACTACCGAGGGAACTGCAGAGCCTACTACCGAAACTACAACAGAATCTAAAATTGACTTAGAGACAACAACAGCAACTCCCGAACCAACAGAACCGGCCACAGAAAGCGGGACCAAAACAGAAACGTCAAATAATCCCGAAAACAATCCACAACCAGCGAATGACAATGAGGGAAAACCAAATACTTCTGAACAAGAAAATAAGATACCAATTCAAAATGATACACAAACCATAGAAACGAAAGATGATGGACTTAAGGAACCAGAAGGGGACAATAAAACAACGCCTTTAGACACATCCAAGGATGGAGATATCCACCCATCAGCAAACGAACCAGCGACGCCGTCTGGTGAAGCTAGTGACACTAAACAATCGGACAAACAAGAAAATGTTCTAAACAATCAAACCGAACCCACAACAGAGGCTCCTGCTAATCAAGGTACAATCCCAAGTTCGGAAAAACCAATTGAGAGTAACAAACCGACAACGACTGAAGAGCAAAAGGAAAGTCCATCTACAGCCGATAAAGGTGACGCTAAGAACGAACCAGGACCTGCTGATGAACCGGTTCCTACAGATAAAACTCCGGCAGGCAATGACCAAAAGGTTTTGAATGATAGCCCGAACGCGCCTAGTTCAGACTCTCAAACACCCAAACCGCTTGATCAAGAACAAACAACAGCCGCTCCTGATACGACGGAAGCACAAATCACAGATGCGGGAACGACTGAGGATCCAAATAAAAATGTTGCCAATGAAACGACCTCACAACCCTCAACCACTGGCGAATCAAGTACAACAACAGAAGCAACTACTGTGACTAGTGTTCCGGAAAACTCCGGCGGAACTCCTGAAGCAGCTAATAATAATAGTGAGAAATCACCCAGCAACGAAAATCAAATTCTATCTACCGATAAACCCCAAGATGCAAACATAACCAGTTCACCAAGCGATGATACGTCCGCTAAAAGTCCTGAAGATTCCTCACCTGGCACATCAACCTTATCAGATGTTACTCAAGACCAGAAACCTGTTTCTGAAACAACAAGTGACAAACCTACAGGAATAACACCCACCAGTGTTAGCACGGGGAGTCTTGACAAACCTGGAAACGAAAACGAACCGACGACACCAGCCACGACAGGTGAAGCACAAAACACAACTGAGAGCCTGAAAAATGAAGAAGAATCACCAAGCGATAAGGAATCAGTGACCAGTACCACCCAAAATAGCGCAGACGATACCTCCGAGACTCCATCGAGCACCACAACAGACAAAACACCGAATGATGAGAGCTCTAGTCCCAAAATCGATTCTTTAGACCCAAAAGACAAACCTGAAGCCTCTGGACCAAAAGACGAAGTACAATCTACCGAGTCTCAGTCTACACCTGCGCCACCCAGCCTAAAACCTAACCAGGACCCAAGTACTTCTGGACCCCAGGATCTTTCAACAGACCCGCAAGCTCTCCAAGTGTCGATGAATCTGAAACTCCTCATCCTACCCAAGGGACCAAAAAACACAAAGAATTGCTACTGCGCCTGCGAGGGCCAGGAACCGCCCAAACTGGTCCCGATGCGACAACTTCCGGTCGGGATTACTAGTGTAGATAAAAATTAA

Protein sequence:

>DPOGS211263-PA
MFITYLVLAATLHVCTSAAEPVANRPAHTVTVLQRRRYQGSEQIASEEYNTEEYHSKEDEEEWESKEQKYSFENASDEIVEAKSLGSYHNETNRDSNLVDNLPGVINDSVDKIYRYTKPMEAAADLTDETMVVAYFKTTAGLKTDEEAYEFLKTCRGARGMRFTVKLLKKLTGTADLKVYVHLIKIITGEDDAVQGLRVIQKLTGHYDIDGIIEILQRISGRISMSDLIEFYEHNGDHTTGLEGISHQIYKLTGDAGPKEFPGVFEALKSEVDWTATFNVLVTVMRCNDISESIGDLKKLLGVQTFPEAIDAMKLATEQIHFRYALESFLEETGRVNLKVIITTLYQLTGKTDLLSVLRELQRLYKINNIIKVFQTVNRISKKRDPLVVLNNLVDITSAANLSDCSSVFTGLTFKQMTALEVIEKIQLETGCTDLVDFLKGMLSVTGSRGLLKSWQVIVTVTEKNDIFVLISTLKQYTSADLVTVFRFLQRITNTINIVQAVDKINDLLEVKGFFEWTETLHHAADADVFEFLQAFTSLSPRSLLEEAIETTMHYTDVYEPIQAIEDLKAVTQQDDIIKAINYIKAKKPKLTVTTTTKQAQITTNLETSAETTEKLITEHENASDTSEHLDHSKEQTTNEQKPEETEHSVETSTKPAELTTVASLETTTAPTTEAASETTSEATSESTPEATSESTTEATSESTPEATSESTPEATSEATTEATSESTPEATSEATPEPTAEPTVQSTTELTLGPTTESNIALTTEGTAEPTTETTTESKIDLETTTATPEPTEPATESGTKTETSNNPENNPQPANDNEGKPNTSEQENKIPIQNDTQTIETKDDGLKEPEGDNKTTPLDTSKDGDIHPSANEPATPSGEASDTKQSDKQENVLNNQTEPTTEAPANQGTIPSSEKPIESNKPTTTEEQKESPSTADKGDAKNEPGPADEPVPTDKTPAGNDQKVLNDSPNAPSSDSQTPKPLDQEQTTAAPDTTEAQITDAGTTEDPNKNVANETTSQPSTTGESSTTTEATTVTSVPENSGGTPEAANNNSEKSPSNENQILSTDKPQDANITSSPSDDTSAKSPEDSSPGTSTLSDVTQDQKPVSETTSDKPTGITPTSVSTGSLDKPGNENEPTTPATTGEAQNTTESLKNEEESPSDKESVTSTTQNSADDTSETPSSTTTDKTPNDESSSPKIDSLDPKDKPEASGPKDEVQSTESQSTPAPPSLKPNQDPSTSGPQDLSTDPQALQVSMNLKLLILPKGPKNTKNCYCACEGQEPPKLVPMRQLPVGITSVDKN-