Monarch geneset OGS2.0

DPOGS203939
TranscriptDPOGS203939-TA1806 bp
ProteinDPOGS203939-PA601 aa
Genomic positionDPSCF300005 - 203472-208236
RNAseq coverage426x (Rank: top 29%)
Annotation
HeliconiusHMEL0120891e-13643.76% 
BombyxBGIBMGA000745-TA7e-5936.23% 
DrosophilaCG7927-PA1e-4229.91% 
EBI UniRef50UniRef50_UPI00022465B24e-6132.59%UPI00022465B2 related cluster n=1 Tax=unknown RepID=UPI00022465B2
NCBI RefSeqXP_973489.25e-6233.33%PREDICTED: similar to mmr1/hsr1 GTP binding protein [Tribolium castaneum]
NCBI nr blastpgi|1892352761e-6033.33%PREDICTED: similar to mmr1/hsr1 GTP binding protein [Tribolium castaneum]
NCBI nr blastxgi|3454824744e-7032.34%PREDICTED: transcription factor 25-like [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[199-536] IPR0069944.2e-55Basic helix-loop-helix, Nulp1-type
Orthology groupMCL14325 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203939-TA
ATGTCCCTGCGCAATGTACGTAAGCTTTATGGTGCAACAGCTCTCCCTCCTCCAAATGAAAGTTCTGATGAAGAGTATGAACCCCTTTATGCAAAAAATTCTGCACGTAGTGCTTATGCAGGGTTGTTGTTGAGTTCAAGTTCTGAAGATCATGAATCACCACTGGTTAGTGGAAATAACTCTGGAGCTGAAGATATGAAAAAGAAAAAAAAGAAAAAGAAGAAGAGGCAAGATAAAATAAAACGATCATCACAGAATTCTTTTAATGGTGGCAAGTTGGATGAGATTGATAAATCCTTAATGGAAGTCAATGCACTGCTTGGTGAACCTGAGCTGGAGGCTTCATCGGAGTCAAAATTTAAGCCAGACTTACTACATATTCTTTTCTCGGTCAAATATAAACATCTTAATGTCTCCAATGAACTATTGAAAATGTTTGGGCCCGAGACACCTGAGGAGACACCACGACGCGTTGTTCATGGCCGACAACCTAATTTGAAACGTATCCAGAAGTATTCTATAATTCCTCAGGAATTTAATTTCAAGAAATTGGGTTTGTCAATGTCTGTTGACCGCCGTGATCATGGCATTAGCTATTTCGTTTATGATCACAGCAGGGAATACCAACGTGTTCACAAAGAATTCATGGTGATGCTTACACAGAGGGCTACCCACCTCATGACACCATTTGAGACATCTTTGAAAAACATGCATGTTGAAGGATTGCTGGAGGCTTCCGATGTGATGTTCCGCTTAGAAGATTACTCTGCTGGTAATAAAATCGTGGAACAAGTAATAGCATACATGCAGTTTGTTGCACATCCATCATTCAATGTCACCGATATGCGAGTACGTCTCGAATACAAATATTTGGAAAACAGACCGTTCCACATTGCTTTGCTAAAGTACTTACATCTTCTTACAAATAAGGCTTGCCATCGCACAGCTCTTGAAATAGCAAAGCTGATGCTAAACTTGGACCCTTCCGATCCTTTGGCGGTCATCTTCATTATTGACACTGTTGCTCTCAGGGCTAGGGAACACCAATGGCTTATAGATGCCATAGATTATCTGAACAAGGAAAGGGAGGCTGGATTCATGTTTAATATTCAGTTCTCATACGCCCTTGCGTATTTCCATGTCCTTACCAAGAACAAGCAAAGTATCAAGAAAGCCGATGAGCTACTACAAAAGGCCATGGCAGGGTTCCCGTGGGCCTTAATGCAAATACTGCACAGTGCCAACTACACTCCCGATGAACGTCTACGAGCTCATCCGCTCTTTAACAGCTACGCTTTTTCCACGACATGTAAGAACCTCAAAGACCTTATACTTCTTTATGCCACCTTCACCGGAGCTCGCTGGCGTGAACCGCCAGTTATGGAGTGGCTAATACGGAACGCCAACGAATTGGCTGATAGATATGACGTTGACTCATCCATCAAGGAGCAAACCCAGGGTCTGCAACAGGTGCGTCAAGTTCTATTCCGCGGCTGGCCGGAGCAAGTTTATCGTCACCTCAGTGTCATCAAGACACTCTCCAACCTTCTGGTTGATGGGGCGGTGCCACGTGTAGCGGCTACACGCTGCTACGACCCTGTCCCGCCTCGCGACGGCGTCAACCGTTACGGCTACACTCTCATACCGCACACACACATCAACCTCGGTAACGCTATACTCACCGACTTCTTCACATCCCTGTTACCGAACTACGACATACCGCCCGAGGATGAGTACACATCTGAACCACCGTTTCCGGCCATCCATACTAGATTACCGTCTACCGCAAGAGAACAGAGCAGCTGA

Protein sequence:

>DPOGS203939-PA
MSLRNVRKLYGATALPPPNESSDEEYEPLYAKNSARSAYAGLLLSSSSEDHESPLVSGNNSGAEDMKKKKKKKKKRQDKIKRSSQNSFNGGKLDEIDKSLMEVNALLGEPELEASSESKFKPDLLHILFSVKYKHLNVSNELLKMFGPETPEETPRRVVHGRQPNLKRIQKYSIIPQEFNFKKLGLSMSVDRRDHGISYFVYDHSREYQRVHKEFMVMLTQRATHLMTPFETSLKNMHVEGLLEASDVMFRLEDYSAGNKIVEQVIAYMQFVAHPSFNVTDMRVRLEYKYLENRPFHIALLKYLHLLTNKACHRTALEIAKLMLNLDPSDPLAVIFIIDTVALRAREHQWLIDAIDYLNKEREAGFMFNIQFSYALAYFHVLTKNKQSIKKADELLQKAMAGFPWALMQILHSANYTPDERLRAHPLFNSYAFSTTCKNLKDLILLYATFTGARWREPPVMEWLIRNANELADRYDVDSSIKEQTQGLQQVRQVLFRGWPEQVYRHLSVIKTLSNLLVDGAVPRVAATRCYDPVPPRDGVNRYGYTLIPHTHINLGNAILTDFFTSLLPNYDIPPEDEYTSEPPFPAIHTRLPSTAREQSS-