Monarch geneset OGS2.0

DPOGS215825
TranscriptDPOGS215825-TA1761 bp
ProteinDPOGS215825-PA586 aa
Genomic positionDPSCF300073 + 53985-60797
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0170163e-11241.57% 
BombyxBGIBMGA010909-TA9e-13849.44% 
DrosophilaCG1399-PB2e-1125.50% 
EBI UniRef50UniRef50_E2BHS25e-3825.59%Uncharacterized protein C14orf166B n=3 Tax=Formicidae RepID=E2BHS2_HARSA
NCBI RefSeqXP_001623578.19e-2828.25%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|3072069092e-3725.59%Uncharacterized protein C14orf166B [Harpegnathos saltator]
NCBI nr blastxgi|3838595146e-4027.09%PREDICTED: uncharacterized protein LOC100877744 [Megachile rotundata]
Group
KEGG pathwayecb:1000727882e-07 
 K12798 (NLRP1, CARD7)maps-> NOD-like receptor signaling pathway
Orthology groupMCL30793 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215825-TA
ATGTCATCTTCTTCAGAGGAATCAGAAGAAATGCCAGTTAAGAAGTTATATAGTCGCGATAACGATGATTTATCGTCATCAGAGCCACCAATGGAAGAATGGTCTTCTCTCATGTCTATAATTCCAATGAAGAGCCATAAAATGTCATTGTATGAGCAGGGTTTATTTAAACCAGGCAGCGATGAAATTTGCACTAAATATATACCATTGTCAGCGAGTTCCGTAATCAGACACCCTTACTACGCATATCCGGGAATAAAAGATCCGGGTATCAAGGAAGCGTTGTTGGACCCAGAACCAAGAAAAGTTTATCCCCTGGATGGACAAGAACTTTATTTAGATTTGTGTGAAGAAATGAAAGTAATGCCAGTACGGAGCTTCGTACGAGGTCTGCTCGAAGAAACCATTGATTTGAGATACTATGGGGTTAATCCTGTCGGAGTGCGGGCTATGAGCATGGCGCTTAACTGCAACCAGTACGTACGACGGTTAGACTTGACATCCAATTTTTTAAGCGAAGACGCTTGTTATCATCTTGGCCAGATGTTAAGAGAAAATGTTGCGCTACAAGAGCTTGTGTTTTGTGAATGCAGGCTTCAAGTAGAAAGTCTCCGTAAATTGGTGGTGAACTTATACTCGAGGTCTCTGGAGCTGTTAGATTTATCTCGGAACGACTTTGGGGACGATGGCTTCAAACATCTCGCCTATCAGTTATCTAGGGGGGCAATTATGAGAAAGTTAAATCTCAGCTATAATGGCCTGACTTCGGCTTCTGCGTCACTTTTTGCATCAGCGATTGAAGGAAATAACTGCATAACTCATTTGGATCTTTCTTGGAACAAAATGTCGGTACTCAAAGGCAAGCCAGGGATAGCTAGTCTAAAACAACTTAAACATCCACTCAGTGTCTCAGCGTTGTTCTCAAGCAGTAAACAACTGGGGAAGTTCAAAGTTTTCCCTTGCGGTGTTAACGAGTTACTCAAACAACTAAGCTGTAGTAAAGTCTTGATTGAACTCAACATGTCCTGGAATGCCGTAAAAATATCTAGAATCTTAAGGAAGTTGCTGACTGTACCTACGCTACGAATATTGGATCTCAGCAACAATAGAATCTCAAGGCAAGGAGTTACAGCCATTGTAAATAATCTTCAATCAGCCGTTAGCCTACACACTTTAGATTTGTCCTACAATCCAATAACATCTCGTGATGCTCTTTTGCTTCTTAGCAAATTACAAATAAAGTCTATTCGACTCGTCAACTTAATAATGGATAATATAGAAGTTAATAGAGATTTTGTGAAGGAGCGTGCTAGGATTCTGTCTCTGAAGTATCGCCGCAATTGCAAAATAACTTATGGACCTGTACGTCACAACTATGTATTGTCAACGCCTGATTTGCGAGAGATTTTACTGAAACGATTTGACTTCCTCACCTCCAGAGGCTCGAAAAGACATCAATTAGATATAGGACTATATTTTCTAGAGAAAAAACAGTTGGAAAATTTTATTCAACCACGTCAAGTAATGCGTGATATGAAAATAGCTGGGATATCAGTAGACAACGAACTGATTGATGGTGTGGCTGACATGTTTCCTGGTCCGAAATTGGAGAAGGGAGGAAAAACAATGGATCTAGTAGGTATAACAGAAATAGTGATGCGTTTGTGGCCTGAAAAAAAAATACCACCAAAGCCAGAACCGGGACCGGAAGAGAAGAATGTAAAAGAGGGACGGAGAGGAAAAAAGAAAAAGAAGAAGTAA

Protein sequence:

>DPOGS215825-PA
MSSSSEESEEMPVKKLYSRDNDDLSSSEPPMEEWSSLMSIIPMKSHKMSLYEQGLFKPGSDEICTKYIPLSASSVIRHPYYAYPGIKDPGIKEALLDPEPRKVYPLDGQELYLDLCEEMKVMPVRSFVRGLLEETIDLRYYGVNPVGVRAMSMALNCNQYVRRLDLTSNFLSEDACYHLGQMLRENVALQELVFCECRLQVESLRKLVVNLYSRSLELLDLSRNDFGDDGFKHLAYQLSRGAIMRKLNLSYNGLTSASASLFASAIEGNNCITHLDLSWNKMSVLKGKPGIASLKQLKHPLSVSALFSSSKQLGKFKVFPCGVNELLKQLSCSKVLIELNMSWNAVKISRILRKLLTVPTLRILDLSNNRISRQGVTAIVNNLQSAVSLHTLDLSYNPITSRDALLLLSKLQIKSIRLVNLIMDNIEVNRDFVKERARILSLKYRRNCKITYGPVRHNYVLSTPDLREILLKRFDFLTSRGSKRHQLDIGLYFLEKKQLENFIQPRQVMRDMKIAGISVDNELIDGVADMFPGPKLEKGGKTMDLVGITEIVMRLWPEKKIPPKPEPGPEEKNVKEGRRGKKKKKK-