Monarch geneset OGS2.0

DPOGS207158
TranscriptDPOGS207158-TA1236 bp
ProteinDPOGS207158-PA411 aa
Genomic positionDPSCF300001 + 4427634-4482011
RNAseq coverage445x (Rank: top 28%)
Annotation
HeliconiusHMEL0130292e-12061.35% 
BombyxBGIBMGA004945-TA1e-12363.19% 
DrosophilaMESK2-PI8e-12361.49% 
EBI UniRef50UniRef50_Q9GU501e-12061.49%GH09802p n=48 Tax=Pancrustacea RepID=Q9GU50_DROME
NCBI RefSeqXP_001600170.18e-13562.02%PREDICTED: similar to Misexpression suppressor of KSR [Nasonia vitripennis]
NCBI nr blastpgi|2700046187e-14364.94%hypothetical protein TcasGA2_TC003984 [Tribolium castaneum]
NCBI nr blastxgi|2700046184e-13564.94%hypothetical protein TcasGA2_TC003984 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[67-378] IPR0041421.7e-131Ndr
Orthology groupMCL11638 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207158-TA
ATGGATAACTATGCCTTTGTCAGTGATGAATTGGTGCTGGAATCAACTCTGTCCGGAAAACAGGGTCTCCCTACCGCAGAGCCAATGCCGTCAGTTATTTGCCTTGAAACAAAGAGGGGCAGGATCACGGAGCCAAGAAACGTGCTAATAAAGATGCCTGCCAGCTCTGAGGATACCGCTCTGTTGAGCTTGCTGGCGATCGACAGCATGGACGATATAGAGTTGAAGAGCATCCAACTACAATTCCCGCGACGGTTCAGTGACGGAGCCTGTCAGGAGGTGCGAGTGCACACCGACCGTGGGGATATCATGGTGGCGGTGCGCGGGGAACGCACCAAGCCCGCTATACTGACTTACCACGACATGGGGCTCAATTATACCAGCTTCCAACCATTCTTCAACTACGTGGACATGAGAGCGTTGTTGGAGAATTTCTGCGTTCTGCACGTAAATGCCCCAGGCCAGGAAGAGGGGGCGCCCACGCTGCCTGACGATTATGTATATCCAACTATGGACGAGCTCGCCAACCAGATCAACTACGTGCTCGTGCACTTTGGCATTAAGAGCTTCATAGGCTTCGGCGTGGGAGTCGGTGCCAACATCTTGGCGCGTTTCGCTCTTACCAACCCTGATAAGGTGGATGCTCTAACTCTAATCAACTGTTCGTCGAGCCAGGCAGGCTGGATTGAATGGGCGTCTCACAAGATGAATTGTCGCGCGCTTCGCAGCCGCGGTATGACACCCGCCGTGGTCGATTACCTCATGTGGTACCACTTCGGAAGGTGTCCTGAGGAGCGCAATGCTGATCTCTCAGCGATGTACCGTTCCTACTTCCGTCGTCATGTGAACGCCGGGAACCTCGCCATGTTGGTGGACAGCTTCGCTCGTCGCACCGACCTGAACATCACCCGCCACGCCGGCACCCTCCGCCCGCCAGTCCTTAACCTGGCCGGAGCACTCTCCCCGCACCTCGAGCAAACGGTCACACTCAACAGCCGTCTACACCCATCCAACTCCACCTGGATGAAGATCTCTGATTCGGCCATGGTGCTAGAGGAGCAGCCCGGAAAAATCTCTGAAGCATTCCGTCTATTTTTGCAAGGGGAAGGATATGTGGCGCCGTTGTCTCCAACCAAGATCGTGTTGTTCCGCCGGCTGTCTGACGCTCGCGTGTGCCGTCACTCGTCCGTCATCCGCATCACTGAGAACCCTATCTCGGAGGCTGTGGTCTGTTAG

Protein sequence:

>DPOGS207158-PA
MDNYAFVSDELVLESTLSGKQGLPTAEPMPSVICLETKRGRITEPRNVLIKMPASSEDTALLSLLAIDSMDDIELKSIQLQFPRRFSDGACQEVRVHTDRGDIMVAVRGERTKPAILTYHDMGLNYTSFQPFFNYVDMRALLENFCVLHVNAPGQEEGAPTLPDDYVYPTMDELANQINYVLVHFGIKSFIGFGVGVGANILARFALTNPDKVDALTLINCSSSQAGWIEWASHKMNCRALRSRGMTPAVVDYLMWYHFGRCPEERNADLSAMYRSYFRRHVNAGNLAMLVDSFARRTDLNITRHAGTLRPPVLNLAGALSPHLEQTVTLNSRLHPSNSTWMKISDSAMVLEEQPGKISEAFRLFLQGEGYVAPLSPTKIVLFRRLSDARVCRHSSVIRITENPISEAVVC-