Monarch geneset OGS2.0

DPOGS202363
TranscriptDPOGS202363-TA1899 bp
ProteinDPOGS202363-PA632 aa
Genomic positionDPSCF300104 - 158821-168868
RNAseq coverage1034x (Rank: top 12%)
Annotation
HeliconiusHMEL0028897e-14970.98% 
BombyxBGIBMGA013997-TA4e-17276.73% 
Drosophilasimj-PB1e-6641.65% 
EBI UniRef50UniRef50_E3X6499e-7541.37%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3X649_ANODA
NCBI RefSeqXP_968148.15e-7739.06%PREDICTED: similar to simjang CG32067-PC [Tribolium castaneum]
NCBI nr blastpgi|910866671e-7539.06%PREDICTED: similar to simjang CG32067-PC [Tribolium castaneum]
NCBI nr blastxgi|910866671e-9539.71%PREDICTED: similar to simjang CG32067-PC [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL10974 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202363-TA
ATGGACGTTGACGACTCCGCCGTCGATCTTAGCGTGAGCAGGTCTTTACCGCCTGAACTCAGTGAACTAAGAGCTCTCACTGCCAGCGGACTAACCATAACACCTGCACAACCACCGCCTCATGGTTCGGCTGGTAAGAGGGTACTTCGCCCCCGCGCCGATCAGCGCAGTTACGCGGAGAGCCCTGATATAGTGCTGCTGCCCGCAGACCCTCCGCACAGGAAGCCGGTGTTGCCAGCACCTTCACCGTTTACCGATGTCAGCAGTAAGTTGAACACAAACGTCAACCTGTCTATAACGGCCAGCGAGTGTGCCAGGGACCCCATGGACGACGAAGAGTCTGACGGAGAGAACGAACCACCCCTCCCAACACCCGCGCACAGGGAGCTATCCAGCGCTGAGATATGGGAACGAGAGCGACGTCTGAGATCACTGCGAGAGAAGCTGCGGGCGGAGGAGACGCGCCTCGTACTCCTAAGGAAGCTGAGACAATCACAACAAGCCACTGTCACGACTACTAAGGAGTCTGTGTGCCCGTCCCCCGGCACGGCCCTGGCCGGAAGCGGGTGTGTGGTGCCCCCCGGGGTCACGGTCACCCCCGCCCCGCCGCCCGCGCACCAACACGCCAAGCGCGGTAGTGTGTCGGGCGCCGCTGGCAATCTGTCCACGGCTAGGCGCACGGCCAGTCTGCCAGGGGGCGCCACTCTCACGCCGGGCCCGTACAGGACTCAGTCATCAGGGGGCGCTAGCATCACTCCATCGGTGACGATCACACCAGCGCCGCCTCCGACACACGCCCACAATAACAACAACACTACCAACAACAATAAGGCCAGTTCTCGCAGCTCCGAGGACCCGCAGACGCCGGCCCAGAGGCAGGCGGCCGCCAAACTGGCGCTCAGGAAACAGCTGGAGAAGACACTGTTACAGATCCCGCCTCCCAAGCCCCCCCCGCCGGAGATGAACTTCATCCCCTCCCCCAGCAACACGGACTTCGTGTACCTCGTGGGGCTGGAACACGTGGTGGACTACCTCACTAACGAGGACCGGATGCCGCGGTCGTCGGTGCCGGCGGTGTGCGCGCAGTGCGGCTGCGACTTCACGGCCGTGTGGCGCTGGGAGCGCGCGCCCGCCCGCAGGCAGGACGCCACCTTCCCGACGCCGCACACGCACACGCCGCGCAGGCTGTGCGAGCTCTGCGTCTCCGGGAACGTCAAGAGGGCGCTCAAGGCGGAGCACACGGCCAGGCTGAAGACGGCCTTCGTCCGAGCGCTGCAGCAGGAGCAGGAGATAGAGAGACGACTGGCGGCGCCCAGCCCGCCGCCCCCCGCCAGCGCGGCCCCGCCGCCCGCCCACACGCACCATCACAGACCGCAGACGCTAGAGGTCATCATGCGGAGCGGGCCAGGGACAACGAGGGGAACAATAAGAAAGGTAACACACGGACATACACACATGGACATGTACATATATAATATACTATGGACAGAGTTATATGCAGATGCTATTACAGGGTCTTCATCAAGCACCAGCAGTCAAGGGAGCAGCAAACAACATCAACTGGCAGCAGCTGCGGCCGCTCAGATGGCCTTCGAGCAGCAGAGCGCGGCCGCCATGCAGGCGTTGCAGCACCAGCTGCTGAGAGGTCTGAGCGGCGCGGGCGGGACGGGCGGCGTGTCGCAGGCGGCGGCCGCGGCGGCCATGATGCAGTTCTCTCCCCTGCTCTACACATACCAGCTGGCCATGGCCCAGGCCAGCGCTCTCGGCAAGAGATCCGGCAAAGGTTCGAGTAACGCCGCCATGGCCGCGGAGATGCAGCGCGTAGCCGAAGCTCAGAGACAGTACCTACTGGACATGATCCCGGGACAACACGCGCGCAACCCCTGGACCAAGAACTAG

Protein sequence:

>DPOGS202363-PA
MDVDDSAVDLSVSRSLPPELSELRALTASGLTITPAQPPPHGSAGKRVLRPRADQRSYAESPDIVLLPADPPHRKPVLPAPSPFTDVSSKLNTNVNLSITASECARDPMDDEESDGENEPPLPTPAHRELSSAEIWERERRLRSLREKLRAEETRLVLLRKLRQSQQATVTTTKESVCPSPGTALAGSGCVVPPGVTVTPAPPPAHQHAKRGSVSGAAGNLSTARRTASLPGGATLTPGPYRTQSSGGASITPSVTITPAPPPTHAHNNNNTTNNNKASSRSSEDPQTPAQRQAAAKLALRKQLEKTLLQIPPPKPPPPEMNFIPSPSNTDFVYLVGLEHVVDYLTNEDRMPRSSVPAVCAQCGCDFTAVWRWERAPARRQDATFPTPHTHTPRRLCELCVSGNVKRALKAEHTARLKTAFVRALQQEQEIERRLAAPSPPPPASAAPPPAHTHHHRPQTLEVIMRSGPGTTRGTIRKVTHGHTHMDMYIYNILWTELYADAITGSSSSTSSQGSSKQHQLAAAAAAQMAFEQQSAAAMQALQHQLLRGLSGAGGTGGVSQAAAAAAMMQFSPLLYTYQLAMAQASALGKRSGKGSSNAAMAAEMQRVAEAQRQYLLDMIPGQHARNPWTKN-