Monarch geneset OGS2.0

DPOGS207536
TranscriptDPOGS207536-TA3699 bp
ProteinDPOGS207536-PA1232 aa
Genomic positionDPSCF300177 + 665495-674316
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0217890.064.07% 
BombyxBGIBMGA001900-TA0.057.69% 
Drosophilasha-PB2e-10239.08% 
EBI UniRef50UniRef50_UPI0000D56B092e-12144.68%UPI0000D56B09 related cluster n=1 Tax=unknown RepID=UPI0000D56B09
NCBI RefSeqXP_394811.35e-12732.20%PREDICTED: similar to shavenoid CG13209-PA [Apis mellifera]
NCBI nr blastpgi|3838490011e-12734.59%PREDICTED: uncharacterized protein LOC100880665 [Megachile rotundata]
NCBI nr blastxgi|910877873e-17736.91%PREDICTED: similar to shavenoid CG13209-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL18982 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207536-TA
ATGGCCCTGCTGCTGTTGCTGGTGGCAGTCGCGGGCGCAACGAGCCTCGGAGACCTGACGAGAAAGGATTCCGGAGACGTCTTCACTATACTTGATGGTGAATGTGGTGCTGCAAGATGCGCGGAGCACGGAGCTGGAAAAGGCGTAGAAGACCTCGAGTGCTCCTGCGCGTGTCCCCAACGAGCTCCCCTGTTCAGAGAGGACAGGGAATTGTGTGTTGACGACCTACCAGAATGCTCCCTCGCAACTTTCGGCACTGGTCTTGGCACGCAGAGGATACCCTTCGTTTATCTCCCATTGAAGGGTCAAATAATCCATCCATCCAGAGAAATTACTTTTCAAAATGTCAAGACTCCAATATGCGCAGTTTCGGGAGCACAGTTTTTAACACGAAAGGGATTCCTCGATCTTAGGAATACACTTGACGCTGATGTTCCGTTTAATTTATTCCGTGATGAAGGCAGAACATTTCTTCAGTGGAGTGGGGAGGACGAAGTGCGCAAGCGCATGTCAGGTCGTATGATGGTGGTCAGGCTGCTGTGTCGTGATATATCCGCCGCAGCGTCTTCGCCGCTCGACCTTCGTGGAGTATTCACGCCATGCGTTGCCTTCAGAGTCCAGGGAACACCGCCTCGACATTCCAATAATATAACCGAAGTTCAGTTTGCACCAAACGTACAGACATCAGAAGGCTCAACAGCGACTGGTTTAACTGTATCAGAATATATAGCCATTGGTATCAGTTCCTTGCTTTTAGGTTTAATTTATGTGGCATCTGTATTTTTATATCTACACATTAAAAAGAAAAGGAATTCAACGGCAAAAGAGAACGGCTTAAGAAAACTTAAAGGACTGAAAAAGGATGGCTTGACTATAACGGAACGTGATATAATAAGGATAAACAATGAACGTATACAATCCCTGCCAAATGTTATGGGTCAAGATGATGGTGTTGTCAAAAAAAATCCACTATTGAGTGTAGGACGGCAGTTTGATAACAAAACGTTTCCTAGTGATCTCTCTGATTCAGATGATTTTGCGGACACTTTGCGCAAGGATGATAATTCCTGCCATAATCAACTAACATCAGTAGTAATACACAGACATATAGACTTGAAATGTGATAATGTAGATCATCACAGAGAAGAGAGCATTGAAAGATTACCAGACGAACATGTCAGCATTGTAGAAACTATAGATGATAGAGAAATAACGCGACCGGTTGGAACCACACGCCGGAAACTTTACTTCAACCCAGCATACTTTGAACCACAATTAATGGCTGATCCCCCTCCTGCAGCTATTGAGTTTCTCTCAAAAATTCGAGAAGTAATATCAATCGCTAAGCAAAAGATGGCAGCAAAACGTTTTCATCCAGTGCTAAACGAAATACCAGAAGAAGAAACATATCCTTCAAACGCAAACAGCATAGATATGTACCACGGCGTGGGGAGCCAACGTAGCGGTAGCCTTGTTAGTTTGAAAAGAGAAAATAGCAGGAAAAGATCAACAAATTGTATGGGTTGCCCAGGATGTAAAAGCAATGTAAGCAATGATATTCAAAATCTTGTTAAACAAAACGTTACAAAATCATGCACTAATTGCTTCAATGAGAAAGGAGAGAAACAAAACAGTATTCGGAAGTGGCTTGAAAACATCCCAAATGTTAAAAACTCACTTTTCTTCAACGATAGCGACGCTCCAAACAATTTAACGCATTCACTGCATGCTTTACCTAGCGAGGGAAGCCAATGTAAACTACAAAAACAAAATAGCTTCTCCCATGTGACGCACTCAACAGACAACCTCACGTCACACAGATCCACATCCCGGTCTGTGAGATCTGAGCCATCATTAAGAAACTATAATATACCTTTACCAGAATTTAATAGCGAAACCACTGAAAATAATAACTACCTAACTATGTCTCGAATAAATGAATTAAAAAATATAGAATTAAACGAAAGAGCGTTTGACGTACAAAATGAAAGAGAAGTCTCGAGACAAAACATGCAGACATTGAAAAATAAAAGTGGTTTACCCGACATGGTCAACGAGGCTATAGCGCTTGACCATTTTTCAAAATCTCTATATAATACCAGCAGTTCAGATGAGGAGAGATGTTCTAGAAATGCGCCAGAAAAAAGCAATTCAGATAGTCCTTCTGGAAACGAATATGAAACTGATAGTCTTGAGAGGTCGTCTCATAAGAGGAACAAAACTACGACCCTTGATTATCTTGAAGTACCGTCATCCCAAGCTTCTCCGAGTTTAAGTACTGCTCTGCCGTTAGAAGAAGAACTAACTATGAGAAACGCTGTTTACAAGACGCCCTCTAGTGGTAACAGTAATACTCCGTCGCCCGAAGCACATATTGGCATAGAAGAGAATCACTATGAGACTATAGACGTTAAGAAAACTGACAATATCCAAGAAACGATAGACATTACGGTTAAGCCTAGTAATAGTTACAGTTTAGTAAGCGAAGTATACGTTAATAATAATTACAATTTTGGTAGTGCGCCTACTTCACCTAGTGGTTCGGAGTCTTCGATGGGTAACAGAAAATTGATTCAATTTAACAATTCTGTAGCAAAACCTGGATGTTTAACCATAGAATTAAAAGATCCCCCTGAAAATTATATCAAAATTCACGAGTCGGATGGCTTTGAACCAGACACTTTGGACCGTAAGCATCTTAAACATAAAGAAAGTGTAGAGAGTATTCAATTAGATCGACAAGACTTCCTAACCGACTGCGACAATTCCGTAAAAAGAGACAAAAAAATTAAACTAGGAAGCAGCGAAACTTTCTGTAAAAATAACGGACAAAAAGAGAATAGTAACAAATTTAACAGTTTAAGAAATGACCACGAGCATGGCTTTGATCGAACTAAGTTGTCGCCTATTTTGTACAGTGGTTCAAAGTCTCTTGACACTGCAACTGATGACACATGGGATGATAACGCAGGTTGGAGTTCTGAGGAAGGCAGAATATTAACATTAGAGCTTAGACACTCGAAGCGACAACGACAATCCACGCCACCGTCTATAAAGCAAATGAAAAATTTGGCTCGACCTGATATTTTGCCTCCCCTACCGCCAACTGAGGACACCCCTATATACGAAAAGCCGACAATCCCGCCAAAGAGGGTTCCATATGGAAGCCCAGTACCACAAACCATCACTGAAAAACGACAAATATTTCCGCGTAATTCAATATCTTGTAGCTCATTAAAAATTGCAGAAACCGATGATATGAATAGTATAAAATTGTGCGAAAATGATCAAAGATCAGAAAGTGGTCGTAACTGTAGGAGAGCATCAAGCAGTTGTAGCAGCGTTGTAAATACAAACACATTCATTAAAGGACAAAAAGCCGAGAGTATTAGAACAAAATTACGTCGCAGAAAAGGTTCCAACATAGAAGATTCCGGATATCTCAGCAGTGATTCCACGTGTTCAAAACAGTTTCAAAGGAAAATAGTAATAGCGAAAATTGACAGTTGTAGTGACAGTGACGAAACAGAAGACGAAGCTAGAAGTGAATCAGGTGCAGAAAGTGTTGAAACACATTCCGTATATTTTGGTAATTGTCCTAGATTACGTAAAAATGAGGAGAGCATCAAAAACACAATAAAAACCACAAGGGACAGCAAACGCAAAGTTATAGTTAATAATGATGTAAATAATGAATAA

Protein sequence:

>DPOGS207536-PA
MALLLLLVAVAGATSLGDLTRKDSGDVFTILDGECGAARCAEHGAGKGVEDLECSCACPQRAPLFREDRELCVDDLPECSLATFGTGLGTQRIPFVYLPLKGQIIHPSREITFQNVKTPICAVSGAQFLTRKGFLDLRNTLDADVPFNLFRDEGRTFLQWSGEDEVRKRMSGRMMVVRLLCRDISAAASSPLDLRGVFTPCVAFRVQGTPPRHSNNITEVQFAPNVQTSEGSTATGLTVSEYIAIGISSLLLGLIYVASVFLYLHIKKKRNSTAKENGLRKLKGLKKDGLTITERDIIRINNERIQSLPNVMGQDDGVVKKNPLLSVGRQFDNKTFPSDLSDSDDFADTLRKDDNSCHNQLTSVVIHRHIDLKCDNVDHHREESIERLPDEHVSIVETIDDREITRPVGTTRRKLYFNPAYFEPQLMADPPPAAIEFLSKIREVISIAKQKMAAKRFHPVLNEIPEEETYPSNANSIDMYHGVGSQRSGSLVSLKRENSRKRSTNCMGCPGCKSNVSNDIQNLVKQNVTKSCTNCFNEKGEKQNSIRKWLENIPNVKNSLFFNDSDAPNNLTHSLHALPSEGSQCKLQKQNSFSHVTHSTDNLTSHRSTSRSVRSEPSLRNYNIPLPEFNSETTENNNYLTMSRINELKNIELNERAFDVQNEREVSRQNMQTLKNKSGLPDMVNEAIALDHFSKSLYNTSSSDEERCSRNAPEKSNSDSPSGNEYETDSLERSSHKRNKTTTLDYLEVPSSQASPSLSTALPLEEELTMRNAVYKTPSSGNSNTPSPEAHIGIEENHYETIDVKKTDNIQETIDITVKPSNSYSLVSEVYVNNNYNFGSAPTSPSGSESSMGNRKLIQFNNSVAKPGCLTIELKDPPENYIKIHESDGFEPDTLDRKHLKHKESVESIQLDRQDFLTDCDNSVKRDKKIKLGSSETFCKNNGQKENSNKFNSLRNDHEHGFDRTKLSPILYSGSKSLDTATDDTWDDNAGWSSEEGRILTLELRHSKRQRQSTPPSIKQMKNLARPDILPPLPPTEDTPIYEKPTIPPKRVPYGSPVPQTITEKRQIFPRNSISCSSLKIAETDDMNSIKLCENDQRSESGRNCRRASSSCSSVVNTNTFIKGQKAESIRTKLRRRKGSNIEDSGYLSSDSTCSKQFQRKIVIAKIDSCSDSDETEDEARSESGAESVETHSVYFGNCPRLRKNEESIKNTIKTTRDSKRKVIVNNDVNNE-