Monarch geneset OGS2.0

DPOGS210264
TranscriptDPOGS210264-TA3054 bp
ProteinDPOGS210264-PA1017 aa
Genomic positionDPSCF300216 - 110259-118071
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0086212e-4545.40% 
BombyxBGIBMGA000027-TA2e-10132.83% 
Drosophilascpr-B-PA2e-1326.91% 
EBI UniRef50UniRef50_Q5ZNT53e-11984.84%Putative uncharacterized protein n=1 Tax=Cotesia congregata bracovirus RepID=Q5ZNT5_9VIRU
NCBI RefSeqXP_001865174.11e-1827.30%venom allergen 5 [Culex quinquefasciatus]
NCBI nr blastpgi|576592519e-11984.84%hypothetical protein CcBV_9.1 [Cotesia congregata bracovirus]
NCBI nr blastxgi|576592512e-11384.84%hypothetical protein CcBV_9.1 [Cotesia congregata bracovirus]
Group
KEGG pathway 
InterPro domain[267-510] IPR0140442e-31CAP domain
[281-539] IPR0012832.7e-30Allergen V5/Tpx-1-related
[161-238] IPR0183797.5e-09BEN domain
Orthology groupMCL26416 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210264-TA
ATGTTGGACAATTTTGGTGCCCTCTTCACTCAAACGGGGTCTCATTTGCGTTATACGACTGATATGTACAACACCTTACGGAGTTCATACCTCGATACTGCTGAGACATACAAAAAGTTATTGGGTGACTTCGAACAATTTAATAGATCAAGAAACGCAGCTAGCAAGGCATCTCCCTTGATAAATTTGAGACCAGAAGGTCCACGGTCTCCTGAAGAAAGACACGCTAAGACAACAGCGGGTGCTAGTTCTAGTAATCAAGATCAACATAGTAATGATGTTATCAACAAAACGAAGGAAAAAATACACAACAAGTTGTACCGGTTCGTTCTACCTCCGGAATATGATGCTTATGATACAAGATGGACATTGAAGTATCGAACAAAACTGCCCGGACACGTAGAACTTATGCCCCAGAGTGGTGTTTACGTCAGCTGCGGAGACCTCAATCACTGTCAGCAAGTATCGAAAGATTGCATATTATTAGCTCGACGATTATTATCACAAGTCTTCAACAGAAATGCACCGAGTGTTTGTTCATCAATGAGTGAGAAAGTACTGGCTTTTAATAATCTCGGCTCTAACATAAGGCCAGATTTAGATGATCATGCGTGTTCGGTACTGTTAAATTTTGTTTTAGAACATGGACTCCAACGTGGTTGGAATACTGATCTACAACCTATCCTCAGTACCTTACACAGCAAGATGCAAGAGATTCGGTTCAAATATGGTGCTCACTTTCTCACAATATTAAGTCTTGCTTTGGGCGCCGATGTCTACGATCAATTCAATCAAGAAGCGGTCATCGTGTCTGAATACTGTCCCAGCATCCCAGGGTGTCAAGATGGAACCCACGTGATGTGTCTACATTACCATCCCGAGAGCGAAATGGGCCCGGGTTGTGTACATCCAGTCAATGTAACTATGACTCCAGACTTAGCTCAACTGATCGTCGATATTTCTAATAGAATCAGAAGTAAAATTGCAAATGGTTTGGGGAGAGGTGAAGAAGCGGTGGATCTTCCAAAGGCTTATGGAATGTATAGGCTGAGTTGGGACACGGAGTTGGCTACATTGGCCCAGGTTTGGGCGAATCAGTGCATATTCAGTCTCGATCTTTGCCGTTCAACCAGAAGATTTCCAACTGTCGGTCAAGTTGTTGGCATGAGTCGGTACACCTTAGACACTTGGATGCCAATATCTGAAGATGCAGCGCAGATCTTCAATACCTCAACCCTCACTCCTGACAAAGTCAGATACGCTATCACATATATCTTGAAAGCCTGGTATGAGACGAAGAATGACGTGTTACCGGATGTTATAGAGTATTTCGATATTAAGAGTTACTCAAAGCATAATCAGTTTGTACAACTCGTGAGTGGTCCCAGCACACACGTCGGGTGCGGAATATCCGCATACAGGGAATACTCCTACAATAATAACAACGCTGCTCTGAACTATAATTCAATACAACTAGTATGCAACTTTTCGCACCAGCCTCGAGTTGATGAGTCCGTGTATCAAACAAAGCCGCCAACGGAGCCCGGGTACACTGTGAAATGCGGATGTCCGCTGGGATATGATGAGGATGAAGATTGTCTTTGCTATGAAACCGAGAGGGAATTGCCATACAGCTGTAAAAGCACTGGTAGGTGCAAACCCTCTGTTGTTGTGTTACCTATTTTCACCGTAGAAGATGCACCCATTCACAGTATAGGGCAAAATAAAAGTTCACAAATCACACCAGCTGTAGGTCCTGACGTCGTGTTAAAGAGACGAATAAATAGTAAAGACTACAGTAGATCGCAAAAAAATCATAAACAAGCAATAAAACAGGACAGCACAATAAAATCAAGAAGAATTAATGTTGATGTAAAGAAAAGATACTTTACAGAAAGAAAAGGAAAAAAACAATCGAGACTGTTAAACGAACATTATAGAAGACACACACCCAGACAAAGCATTTTTGCTAAAGCCGCAGTTTTTGAACTACCCTCCAGATTGAAACAATACGGTCACAAGGTGAATAACTATAAAAAAGATGTTGCACCTAGAAGAGACTTTAGAGAAGTTCATACGATTGTATCAAAATATTTAGAAGGTCGTAAAACGGGGATTCCGTTACGACATAGTATAGAAAGAAAAAACTTAACAATTTCTAGATTTACTGAAATACCTTCGTATACTGTCATAATAACAAGCACACCGCATACTGTAAATCGTTTTAAAAAAGATAAATTTAATTATGGAAATTTAAATGCCACGAAAATGCCAAGAAACGGAGACACGAGTATTAAGCCATTGAATGAAGATACTGACCAAATGTTGATGGATCTATTAGATAAATTGGAGGAAGAAGTGAATAATGTACACTTTAATACGAAAGAAAAAGAATTTTTCGATGAAAAGATACGCAAAATATACGGAACCTTAATCAAGAAACCAGTTGTAGTGGAACCGAAAATTGTTGTACCCAATACAAACGTGTCCCTTGATTATAATGTCGATAATAATCTGGTAGAGAATGATATGAATGGACATAATTTAGGGTCAGATAGTAACAAAGCCAATGCGTATAGCTATCCCCAGTTAGAAGATAGACCGATAAACGAAAAATCTTATAAAAAACAAAAACAAAACATAAATGAGGATAAAAAAGATACAGTTTACAAAAAATCTAATAAATTTCCATACAACAAACCTTTGTATAATTTTGATATAGATGAAAGGGAATTGAGTGATATGTATTTAAGGAAGAAATCACGTTTGAACGCTCTGTCGAGGGGAAAGGTTAGGAATGAAGATTATTTTAAAAGTGACTTGGAGGATTATAACTCACTTGATTCAGACAGACGTCGCCATTACTATGACAAACTGAGTAACCTGCAGAGAAAAATCAGTTTGATGAAAAAACACAAACGTCAGAATTTCGTTTCTGGAGATCGTCATATAAGACCTGTCCGTCCACCCAAACATCCCAATTTAAGTCACAATAGACCAAAAGAAATGGAACTCTACATGCCTAACAGAGCAAGATTTTTACACGGATTTTAA

Protein sequence:

>DPOGS210264-PA
MLDNFGALFTQTGSHLRYTTDMYNTLRSSYLDTAETYKKLLGDFEQFNRSRNAASKASPLINLRPEGPRSPEERHAKTTAGASSSNQDQHSNDVINKTKEKIHNKLYRFVLPPEYDAYDTRWTLKYRTKLPGHVELMPQSGVYVSCGDLNHCQQVSKDCILLARRLLSQVFNRNAPSVCSSMSEKVLAFNNLGSNIRPDLDDHACSVLLNFVLEHGLQRGWNTDLQPILSTLHSKMQEIRFKYGAHFLTILSLALGADVYDQFNQEAVIVSEYCPSIPGCQDGTHVMCLHYHPESEMGPGCVHPVNVTMTPDLAQLIVDISNRIRSKIANGLGRGEEAVDLPKAYGMYRLSWDTELATLAQVWANQCIFSLDLCRSTRRFPTVGQVVGMSRYTLDTWMPISEDAAQIFNTSTLTPDKVRYAITYILKAWYETKNDVLPDVIEYFDIKSYSKHNQFVQLVSGPSTHVGCGISAYREYSYNNNNAALNYNSIQLVCNFSHQPRVDESVYQTKPPTEPGYTVKCGCPLGYDEDEDCLCYETERELPYSCKSTGRCKPSVVVLPIFTVEDAPIHSIGQNKSSQITPAVGPDVVLKRRINSKDYSRSQKNHKQAIKQDSTIKSRRINVDVKKRYFTERKGKKQSRLLNEHYRRHTPRQSIFAKAAVFELPSRLKQYGHKVNNYKKDVAPRRDFREVHTIVSKYLEGRKTGIPLRHSIERKNLTISRFTEIPSYTVIITSTPHTVNRFKKDKFNYGNLNATKMPRNGDTSIKPLNEDTDQMLMDLLDKLEEEVNNVHFNTKEKEFFDEKIRKIYGTLIKKPVVVEPKIVVPNTNVSLDYNVDNNLVENDMNGHNLGSDSNKANAYSYPQLEDRPINEKSYKKQKQNINEDKKDTVYKKSNKFPYNKPLYNFDIDERELSDMYLRKKSRLNALSRGKVRNEDYFKSDLEDYNSLDSDRRRHYYDKLSNLQRKISLMKKHKRQNFVSGDRHIRPVRPPKHPNLSHNRPKEMELYMPNRARFLHGF-