Monarch geneset OGS2.0

DPOGS210691
TranscriptDPOGS210691-TA2115 bp
ProteinDPOGS210691-PA704 aa
Genomic positionDPSCF300013 - 663789-673401
RNAseq coverage367x (Rank: top 32%)
Annotation
HeliconiusHMEL0080290.060.27% 
BombyxBGIBMGA006307-TA0.061.98% 
Drosophilazwilch-PA6e-0930.71% 
EBI UniRef50UniRef50_Q17A234e-1520.77%Putative uncharacterized protein n=4 Tax=Culicinae RepID=Q17A23_AEDAE
NCBI RefSeqXP_001650883.18e-1620.77%hypothetical protein AaeL_AAEL005433 [Aedes aegypti]
NCBI nr blastpgi|1571099292e-1420.77%hypothetical protein AaeL_AAEL005433 [Aedes aegypti]
NCBI nr blastxgi|1571099294e-1820.95%hypothetical protein AaeL_AAEL005433 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[28-630] IPR0186309.5e-25RZZ complex, subunit zwilch
Orthology groupMCL25225 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210691-TA
ATGGAAAAACTTATACAGCCAGTTAAAGAAGTTTATTCAAAATTTGTATTTGACAAACAAACTCCTAGTTATGTAGAGATATATACGAAAAACGAAACAGAAGCCGTTATTGTGTATTCAAAGGCCAAGCTGACGCCAGCTTCGACTTTACAAGTTACTAACGAAAACAATAAATCAGAAGAATTAGACCTGACGGGATCACCTCTGAAACTGGATTTAAGTTTGCATACAATACTCGATGAAACATTTGTTAATGATGAACCACAGCTGTGGAGGAAAGAAGAGGCTCAGCATGCACCTATCGATATAGATACAGCAAGAGAAATATTAAACTCTTATAACCAAATCACAAACAAGGTGACAGCTGAAAACGCAATTCCAATGTGGGTACTCTGCAAACCGTCAGAGACAACAAGAACATTACTCATGACCATTCAATCTAATGAGAACCAGTTTGATAGGGGCTTAGTTACTTATGAGGGTTCTATGTCACTGGATGAAATCGATGTGGATGAAATGGTCTCCAAGTTTTCGGAACTAGACAGAAGAGATGAATCTGATGTATCGATATCGGTGGAATGTAAGTACGCTATTTCTGGTGTATCATACAGCTCGTATACTAGCGAAGAACAGCTGATTGCTCCCCATGGCGGACTGACGGAATTAACATGTCAATGGTCAAATAAAACGCTGCTCACACCCTTTATAAGCTGTAGTGTACATTTCGTACAGGAGGTGATAATCGGACATATAGCGAGTCCATGCAACGCTATATGGAAATCTGTGTGCGCTTTGCACAATATAAATCAATTGTTGGTAGAAATGACCTCAGCTGGTGTAACTAATATTCCACTAGATAAGGCTTTTATAAGAATCAACAATCCCAATATAAAGCAGCCCAATAATTCAAAACGTCTCACAGAGCTTCTCAATGAGACCGAAATGTACACGTATACAGCCGAGTGTCCTGTAGGTGGGTGCATCTGCGTGACTGAAGACACGACCAGCTTGCGGCAGTGTATGTCCGTCTTGTCCTCCCAGGGGTCCAGTAATGATTTTACATATAAACTATGGGATATTTTAAGAGACTGTGAAACCGCTGAGGAACTAGTTACATTGCTAATACAGGCTCTTAAGTTCATATCGTCTGGGAAGATAAGGCCATTTATAGACGTTAACAATAAGAGCTATCTATCAAAGCTGGTGTTGAAACTGTCGAGAGGACATTCCCAAACATCTAAAGTATTGAAGAACCTTAAATCAAGTCCGCCTCAAGCTCTGTCACTCGTCGGACAAGTTGGTATAGAGAAGACTATGTGGGAGTATACAAGAATAATGTCACTTTTAGAACATTCGTTCTTTATAGCTGGCATCTGGCACTCAGACACCAGATCAAATGAATCGATTGAACAAATAAATCAAACGATCCAAGACATGACAATGGGTGATACGTTGAATCCGTTTGAGAATCTCACATCAACGGAACACTCCATACGTCTGGACACCGAGTCTGTGTGCATTGATGATGACAACGAGCTCACTGTTGATGACTTTACATCGCTGAAGAAGCATGGCTTGGTTAATGAGAGGAAAGACATCAATGAGGTGCCGTTAATAGCCGATGAGATAGATATAAGCCCGTGGAAGAATCTATTGATGAAGTTTGCTCAAGTGCATGTCTGTTTGGAACATTTGTACAGAGCGGAGACATGTCTGAGGGCAGATTTTGTCCAACTGAAACCGATGGCATCAAAGTTATTAGAGAGCTATGTATCTGAGAAGTCGTCCATTAAAACCGTCGGCCAGCTTATGAATGAACCCATACATAATATATTAATACCAATAACAAATAACATAGTACAGGATCAACTCAAGAAACCAGCTGCTTGGTACAGATTGGAACTTGGATTCAAAGAAGTCTCTGATGTTAGAAACTACAAATTAGTCAGCGTTTTTTCACAACTACCTGCATTCCCACCACAAGTATGGCAACACATAGAACCTCCCTCAGATGAAGTGGTTGAAACGACTACTGTGGAAGAATTGAAGTATCATCATACAAAATATATGTTTATAAGTGATAAGTATTCGAGGAAACTGGATTTCTTAATGTAA

Protein sequence:

>DPOGS210691-PA
MEKLIQPVKEVYSKFVFDKQTPSYVEIYTKNETEAVIVYSKAKLTPASTLQVTNENNKSEELDLTGSPLKLDLSLHTILDETFVNDEPQLWRKEEAQHAPIDIDTAREILNSYNQITNKVTAENAIPMWVLCKPSETTRTLLMTIQSNENQFDRGLVTYEGSMSLDEIDVDEMVSKFSELDRRDESDVSISVECKYAISGVSYSSYTSEEQLIAPHGGLTELTCQWSNKTLLTPFISCSVHFVQEVIIGHIASPCNAIWKSVCALHNINQLLVEMTSAGVTNIPLDKAFIRINNPNIKQPNNSKRLTELLNETEMYTYTAECPVGGCICVTEDTTSLRQCMSVLSSQGSSNDFTYKLWDILRDCETAEELVTLLIQALKFISSGKIRPFIDVNNKSYLSKLVLKLSRGHSQTSKVLKNLKSSPPQALSLVGQVGIEKTMWEYTRIMSLLEHSFFIAGIWHSDTRSNESIEQINQTIQDMTMGDTLNPFENLTSTEHSIRLDTESVCIDDDNELTVDDFTSLKKHGLVNERKDINEVPLIADEIDISPWKNLLMKFAQVHVCLEHLYRAETCLRADFVQLKPMASKLLESYVSEKSSIKTVGQLMNEPIHNILIPITNNIVQDQLKKPAAWYRLELGFKEVSDVRNYKLVSVFSQLPAFPPQVWQHIEPPSDEVVETTTVEELKYHHTKYMFISDKYSRKLDFLM-