Monarch geneset OGS2.0

DPOGS209184
TranscriptDPOGS209184-TA2409 bp
ProteinDPOGS209184-PA802 aa
Genomic positionDPSCF300061 + 242315-252115
RNAseq coverage135x (Rank: top 56%)
Annotation
HeliconiusHMEL0097590.085.75% 
BombyxBGIBMGA011536-TA3e-11161.75% 
DrosophilaCG8379-PB1e-11753.90% 
EBI UniRef50UniRef50_UPI0000E464FA2e-17042.80%UPI0000E464FA related cluster n=1 Tax=unknown RepID=UPI0000E464FA
NCBI RefSeqXP_001192297.13e-17142.80%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|2420031822e-13656.74%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420031824e-13056.28%conserved hypothetical protein [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[548-781] IPR0136367.8e-69Domain of unknown function DUF1741
Orthology groupMCL12962 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209184-TA
ATGTCTATGCGGAAAAGGAGTGGATCTGGCTCAAAGAGACATCTAAAAGAAAAGGTTGTACAAATATATGAATCATTTTTCAAAGGAGATGATTTAACCAGTGACAATCCAACATTTTGGGATGAGTTCTTCCTCCTTAAGCCCAAAATTCCACAGTTAGAAGCAGAGATCCAGAAACTAACTGTGGATCAACTTAATAACTTGAAAGATAATATTAACCTTTTAGTTTCCCAATGTATAGAGATGCAGGGACAAGAACATCATATAAGATTAGTGTATGCTCTTCAGACTTTCAGTGTCATATTAACAACAATGTACCAAAGAGCTAGTCAAGATCCCAGTATAAACTTAAAACATGTGCTTCTGGGTGAAAATCCAAATATTAAAATGCAGAAACTTCTAGAACATTGTGGTTCAGTGTTATCAGGTGACGTTCCTGATAGTTTGAAATCGATGGTTTTAAAGTTACTGCTGATTATAGCGACGGGTTTTGATAATATCGATGACAACCCCCTTGTGGAATACCTGATGTCACATTCATTGTTTGATCCTCTCATACAACTTCTGTGCACATCGTCGGAAAGACAGCACCATGGTTACGACGTGGTGCTCCTGGTGATGTTTCTGGTGAATTATAAGAAGCAAGAGTCGGCTAACCCTTATGTTGTTAAACTGTCCATATTGGATGATGAATGTGCATTGAACGGTTACGGCCAAGTTATAACGGCGGCGTTGAACGACTTCGTGATGACCACGTTTGGTGGGATGCAAGCGAGTGGTGGCGGGTGGCTGTCCTCGCTCACCAGCATGGTCGGCGGGATATTCCTCACCACCGATGACACACAGCCAGTCGTTAGAGGACAGAGACAGAGTGGCACAGAGGAAGGTATGCTGCTAGGTCTGTTCGCTGCTGCTCATCTCAACCGGAACTTCATGACGACGTTGGCGCACTCCTCGGCAGCCTCCGCACCCCCCTCCCCTCCAGCCACGCTGCCTCCCAGACAGAGTCCACCCAATCTCGCTCAGATCCAAGCTCTCAATAACGACCAACCAACAAATTTACTAGTGACTTTCTTTCAGTACTGTTACGGCCAAGTTATAACGGCGGCGTTGAACGACTTCGTGATGACCACGTTTGGTGGGATGCAAGCGAGTGGTGGCGGGTGGCTGTCCTCGCTCACCAGCATGGTCGGTGGGATATTCCTCACCACCGATGACACACAGCCAGTCGTTAGAGGACAAAGACAGAGTGGCACAGAGGAAGGTATGCTGCTAGGTCTGTTCGCTGCTGCTCATCTCAACCGGAACTTCATGACGACGTTGGCGCACTCCTCGGCAGCCTCCGCACCCCCCTCCCCTCCAGCCACGCTGCCTCCCAGACAGAGTCCACCCAATCTCGCTCAGATCCAAGCTCTCAATAACGACCAACCAACAAATTTACTAGTGACTTTCTTTCAGTACTGTTCTATAGTGATGGCGGATACCCGCACGGAGACGAGCATCAACAAGTGCAGTCTCTGTTTCATCACCCTCACTTGTATTGTGGAGGAACAGTTCGCCAACTCAATAATGCATGATCAGAATCTGACTTTTAAGGTCCAACTGTATCGTCTACCGATGCGTCATCGAAAAATAGTTCCAGAAGAACCGCCCTCACAACCTCTGGCCTCCACACTCATAGATCTCCTCATCGAGTTCATCATGTGTCACCTCCTCAAGAAGTTCCCCGGTGACTTATATTCTCTCTGTGTGGGTGTCCTACTCCGCCTCCTGAGCTATCAGAAGCGATGTCGTGTCAGATTATCCCGGGATTGGCGCCCACTGTGGGCGGCGCTCATAGCCCTGCTGAAGTTCCTAGTCACCAACGAGAGTGTACTGCTCAGAAAACATAATATATTCATTATGGCACAACAGGTGGTGAATATTTTCAATCTGTTCATAACATTCGGAGACACTTTCCTGCCCACCCCAGCCTCCTATGACCAACTGTACTATGAACTCATCAGGATGTACCTAGTATTTGATAACCTGTTCTTCATGGCTCTCCGCTACTCCACCGGTGACGGTGAGTTCAAAGCGGAAGCCCTCCGCCTGGCCAACTGTTTGGTCAACGTTAGAGCGATCGTCCAACACTTCTCACCCAAGATAGACGCCTGGCTGGCCTCGCAGCACCTCTCCACACCCACAGAGGACCAGATCCTGGAAGTGGTTCGTAAGAACTACGACAGTCTGATACTAAAGTTACAAGAGGGTCTGGAGGGCTACGAGCGATACAACGAGAAGACCCACAAGGCTCTGCTAAGCAAACTGGTCCGCGTAGCAGCGAGCTACGCGCGGGCTCGTAGCGACGCTGCTACGCTCACCCAACACTCGGCCGCGCTACTAGACCACTATACAGCGCTCGCCTGA

Protein sequence:

>DPOGS209184-PA
MSMRKRSGSGSKRHLKEKVVQIYESFFKGDDLTSDNPTFWDEFFLLKPKIPQLEAEIQKLTVDQLNNLKDNINLLVSQCIEMQGQEHHIRLVYALQTFSVILTTMYQRASQDPSINLKHVLLGENPNIKMQKLLEHCGSVLSGDVPDSLKSMVLKLLLIIATGFDNIDDNPLVEYLMSHSLFDPLIQLLCTSSERQHHGYDVVLLVMFLVNYKKQESANPYVVKLSILDDECALNGYGQVITAALNDFVMTTFGGMQASGGGWLSSLTSMVGGIFLTTDDTQPVVRGQRQSGTEEGMLLGLFAAAHLNRNFMTTLAHSSAASAPPSPPATLPPRQSPPNLAQIQALNNDQPTNLLVTFFQYCYGQVITAALNDFVMTTFGGMQASGGGWLSSLTSMVGGIFLTTDDTQPVVRGQRQSGTEEGMLLGLFAAAHLNRNFMTTLAHSSAASAPPSPPATLPPRQSPPNLAQIQALNNDQPTNLLVTFFQYCSIVMADTRTETSINKCSLCFITLTCIVEEQFANSIMHDQNLTFKVQLYRLPMRHRKIVPEEPPSQPLASTLIDLLIEFIMCHLLKKFPGDLYSLCVGVLLRLLSYQKRCRVRLSRDWRPLWAALIALLKFLVTNESVLLRKHNIFIMAQQVVNIFNLFITFGDTFLPTPASYDQLYYELIRMYLVFDNLFFMALRYSTGDGEFKAEALRLANCLVNVRAIVQHFSPKIDAWLASQHLSTPTEDQILEVVRKNYDSLILKLQEGLEGYERYNEKTHKALLSKLVRVAASYARARSDAATLTQHSAALLDHYTALA-