Monarch geneset OGS2.0

DPOGS200553
TranscriptDPOGS200553-TA5238 bp
ProteinDPOGS200553-PA1745 aa
Genomic positionDPSCF300119 + 100542-112732
RNAseq coverage40x (Rank: top 72%)
Annotation
HeliconiusHMEL0168660.049.76% 
BombyxBGIBMGA010769-TA0.042.76% 
Drosophiladtr-PA2e-7454.03% 
EBI UniRef50UniRef50_C3ZYS53e-9349.31%Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZYS5_BRAFL
NCBI RefSeqXP_001810468.14e-9063.64%PREDICTED: similar to leucine rich repeat containing 50 [Tribolium castaneum]
NCBI nr blastpgi|2607824489e-9349.31%hypothetical protein BRAFLDRAFT_82905 [Branchiostoma floridae]
NCBI nr blastxgi|2607824481e-9438.26%hypothetical protein BRAFLDRAFT_82905 [Branchiostoma floridae]
Group
KEGG pathway 
Orthology groupMCL25771 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200553-TA
ATGGCTACGCCGACTGTTCAAGTGGATGATGAGTATAGGAAAGTCAAACGTGCTGAACTGGAAGAGTTTTTTAAAGATCTGGATATAAAGAAAAAAGAGTTGGATAAGATAATAGAAAATGAGTTTGAGGTATTGAATGAAACGGTACAGAACCAGGATAGACCATACATCGCCGCATCAGCTCCAGGCACCGTTAGAACTGAGTATGAAGTGATGGCAGGTCAACCATTCACGGAAATTGGAGACCACTATAAACCATTATCCGACGAGGAAATTAAAAATGTGACTGAAAATAATCCAATTTTAGCTGAAAAAGCAAAGCAAATCCAAATTGTGATTAACTCGGATCGTAATAATAGAGACATGATGATACCATTAAATCAAATAAATATTTCTAATTATGATGAAATAAAAGCTGAATTGAAATCAGCTGCAGGTGATCCAGACCAAATAAATAGGATAAAATTCAAACACATGAAGGCTTTATTTAATTCAGCAGAAAATTACAAAAACGATATGACTAATCAGAAGCAGATTGATAAAGAAAAAGACGAAGAAAATATGATAGACGATTCTTCAAGTGATGAAAAAGAATTTGAAGCTATTGATAAATGTGTAACAGAATGTACAAATAAATCCTATGAAACAAGGTTTCAGGAGACTAAACAGATGCTGCTGAGAGTAGCTGATAAATATGACGATCCCAATATATCAGAGAGCGAGAAGAGTAAGCCTAACGAAGTAGAAAGTTACCCGATTCTTAAGAAATCTTCATCATGTTTAATTAAAGCGCCAAAGCAATGCTCTATTTTGCAAAAAGTTAAAGAAACATATGCGATTAATAAAGCTATGAATATACCTCTACAAGACAACCCGGTAAAACTAAACCTTAGTCGAAAACCAGAGGAAAATGTCAAGAAACAATTTGACATAAAACATTCAGCAATGCTAGAAAAACTTACAGATACTAAAGAAGCCCTCTTAAAAATTGATAATATTTTGAGCGATTCAATTCCTGACGAGGATAAAGATGATAATATGAAAGATTCTGTTAATATCATCGAAATTGAGAACGCAAACCATATTGTTGAAAATAATGAGGTTCCAAGTAAAGATTCCCAAGAAAAATTCGATGAGAAAATGGAAGTAACACTACAAAATGCACTGGAAAATATTTTTGAAATAGGCAAGAACGAAAATAAGGAAAATAATGAATTAGAATTCAAAGAAATGAAAAGTTTGGCTCGAAACATTGTTGAAGGTGCTGAAAATCTCAGCACTCTTATAAGAGAAGATATAACTAACAAACTTAACACCATGAATGAACTCTTAAATGATGTCAATGAAGCATTGGAAAATTCTAGGAAATCTAATATCGCATACCAGAAGCTTAAAGAAGAAGGAGAGGTCCGCAAACGGGTTAAAGAAATAGAAGGAAGTAGAGAGGGACATAACTCTGATTTGAGTTCTGTATCCGTTGTGGATATTGATAATATAAATCAGGCAATTTCAAAAATTAACACTGAAATAGAATGCCAAGAATCCAGAGTTAGCACTAGTCGAACAAACTATGAAGCGAGGAGTAAAGAGTGTCAAGATTTCATTAAAGAAGTGGATGTGATTTTAGAAAAATCTCACAAAATTCTACACCCCTGCGATTCCAAACTTCATGAAGATAATAATATTACAGGCGATTCTGTAAAGAGTAATGTTGGTGACCGTAAGGAGTTGTGGGACGTAGATTTAAAGATAAATAAAACCAAAATAGACAACATACAGAAGCAAAATATGGAAAGGAACAAGAGAATTGATAACCTATTGTACGATATTAAGGATAAAATGAAGGATAATAAAGAAGTATTAAGACTAGCAAATAATTTACTGAGGCGAGAAGAGGGCAAGAAGAAGTTACAAGAGAAAAAAGCATCTGAGACACCGCATATTGAACATGATGATAGAGCTATGGGAGATCATGTTGGAGTTGAAGAAAATAAGGATGATAACACGAAAATTCCTTCGTCTGACATTCAGATTACAGTCCAAAAAGAAGAAGATATAAAGATGGAACAGACTGAAAAAGAGAAGCAAAGAGAATTCCAATTAAAGCTTGAAAAGGAAATAGAAGACAAAGGTCCCAGGATGACAAAACAGTTTATAAAAAATCATTGCAAACAACACAAGTTGTATTCTACACCGTACCTCAATGACATTTTATATTTACATTTTAAGGGTTTTTCGAAAATTGAAAATTTGGAAGAATATACTGGTTTGAAATGCATTTTCTTGGAAAATAATGGTATCCAACGCATTGAGGGTTTGGATACATTATCGGAATTGAAATGTCTTTACCTTCACTATAATGTCGTTAGAAAAATCGAGAATCTCCAAGGATGTCCTAAACTTGATACACTTAATTTGGATCACAACTATGTTAAGAAGATTGAAAATTTAGACGTCGTTCCTGACTTGCATACTTTAAGTCTTGGCCACAACATGTTGGCCACCGTCGAGGACTTGGAGTCATTGAGATTATGCAACAATTTGTCGGTATTAGATCTATCTTATAATCGTTTAGAGGATCCTCTTATTGTGGACGTATTGGCTGACATGGCGCTTTTAAAAGTGCTAGTCCTAACCGGTAACCCTGTCGTCCGTAACATTCCCGCCTATCGCAAAACGTTGACTCTCCGTCTCAAGGAGCTCCTTAACCTGGACAACCGGCCCGTGTTCCCTCGAGACAGAGCCTGCGCTGAAGCCTGGCAACGAGGAGGAGTCCAGGAGGAGATAGCGGAGAGACGCCGTTGGATAGCCAAAGATCAAGAGAAGGTGATGCAGAGTGTCCGCTACCTTATAAAGATGAGAGACGAGAATAAACAAAAGAGAGAGGCTCGGGAACGGGAAGAGAGAGAGAGTCTCGGCCTGCCACCAGTCGTGCTCGAGGATGGCACCATCGATCCTAAGGAGGCAAATGCCGATGAGTTTGTCGATGATACTAAAGTAAACGAAACTGAAAAGAAAGACCAGTCTGGTGTCGTTTCGGACATGCTCTCCGGGTCCGAGGCTGAAGACAGCACGAGCGATGAAAGTGATTCAGACAGCGATTCCAGTGAAAACAAAACAGAGATGGGAAAAATAGAATGGTCTCAGGTTGATCGCGGCAAACATCTCATACAAGAATTGAAGGAAGAACAGACACCAGAGGAACAGTGGTCGGGTTTCGGGTTAACTTCCACCGCAGACATGAAGACTTCAAGCGAGTTACAAGCTATCAGTAATTTGTTATTCAATGAACCATCTCACGTTGAACCTAAGAGAGTGTCACAGACATTACAAGAATCGAGAGAAATATTAAAGGAGGCATTCAACACTGGTGAAGTAACTGAAGAAGAAATGAAAACTGATGCGAGACGAAAACCACTTATTGAGATATTGGAAGAATATAACAAGAGACCAAAGAAAACGGAGGAAGAATGCGTTCCAGAAGTTATACATTTAAGCGGAATTACAGAAGAAGGTGACTTGATAGTAGACCATGATAAGAAGCTGGCATATGAAAGACAAAACATCGTATCACCGCCAACGACTGGGGTGTTTAAAAAAGAAGGAAGTATCAGAAAAATTAAAAAAATGAAGAAAGTTACCATTAAGCAAGTAATAGATGAGGAACCAGAAGACAATGAAAATAAAGAAGAGAATGATAAAAAAGATGTTGAAAATAAAGAAAAACAAAACGAGACAGATGTTACAACTACTTCCACACAAAATATTAGTGAACGAAGAAGAAGCATCATCCCCAGTGAAGGTCACGGTACAGCTTTTCTTAATTACATGAAACATATGAATAAGGATCCTGATTCGAAGGAAAGTGATGATGAAGATTTGAAGCCAAGCGCCGAAGACGAGGAAATATTTAAGGAACTCGAGAAGGAGCAGGCGGAGAGAGAAGCTAGGATCGCTGCGGGGCAACCAGCGGTCGACCCCATGAAGTTGTATGACGCAGCCACCATGGAGGCCTACTACAAACAGCTGGAGGCTCCTCCGGCGCACGCTCGGGTGACCAGGTCTCATGTCACTGAATACAGGCATGATAACGTGTTTGATCGGATTGCCTTGAGCCAACTCACCGGGGGAGAGAGACCAGCCACCATGGAAGCCTATTACAAACAGCTGGAGGCTCCTCCAGCGCACGCTCGGGTGACCAGGTCTCATGTCACTGAATACAGGCACGATAACGTTTTCGATAGGATTGCCTTGAGCCAACTCACCGGGGGAGAGAGACCAGATGAGAATAAAGCAAAATTAACTCACGTCCCCGGAGCGGTCTTGTTCTCGTACGTCAACAAACAAACACCCGTCGAAGTAAATTTTGAGATAGGAGAAGAAGTCCTAGATTCAGAAGCTTCAACCGCTGGCACGGAGTCGATACACATTGACAGTGACAGTGATAGTGATGTGCCTGACACTCCAGACTACAAGGATATGAAACTGAGACGACCGACATCAGCTGGAGTTAGACGAGAAGAAAACAAAAACACCAATAATAATGTTCCACACACTAACGAACATGACCCTGCAAACAAATCCGTTCTGACCAACGATATGATGACAAACGACAGTAAAGAGCGGTGTGAAGCCAAAAGATGCATCATAGACAAGATCAACTCCTACGAAGATAAAAGATTCCCGTCACAAGGCGTGACGGCGGACGTGGCAGAGAACGCTCGCATCGAAGACTCGGTGGCCACGGAGTTGTTGAACAAAACTCTCGAGATGGAAGAGCAGGAACTGTACCGACACATCGACGCCGTCAACAACCACGCGGGCAGAGTCGACAACAGAACCAACTCCATCATCGAGGAGATCTCGGACCAGTTACACCAGGAGTACACGGTAGAAGTCGGTCGCTTCCATGACACCATTAGTTGCCAGCACTTGGCGGAAGTGTCCGAAATATTGGAGACACATGTCGAGGAAGTGGAACAGAGAAGGAGGGAGAGTGCCATATACGAAGACGAGGAGACACCCCCACAGGAGCTCGACACGACGCTAGTGGGGGATGTGTCCAACGATACACTGGTTGCAACGGGAGTAGGAAGCGATTCCATGGTTGACGGGGATGAAGAGCTCGGCCACGCCGGGGCCAGCAACGTGTCGGCCGAGGACGACGTGTTCGAAGACTGCATCGAGGAGCTGCCCGACCACTACACGCTCGAGATGAAGCTGGCGCTCACTGATTGA

Protein sequence:

>DPOGS200553-PA
MATPTVQVDDEYRKVKRAELEEFFKDLDIKKKELDKIIENEFEVLNETVQNQDRPYIAASAPGTVRTEYEVMAGQPFTEIGDHYKPLSDEEIKNVTENNPILAEKAKQIQIVINSDRNNRDMMIPLNQINISNYDEIKAELKSAAGDPDQINRIKFKHMKALFNSAENYKNDMTNQKQIDKEKDEENMIDDSSSDEKEFEAIDKCVTECTNKSYETRFQETKQMLLRVADKYDDPNISESEKSKPNEVESYPILKKSSSCLIKAPKQCSILQKVKETYAINKAMNIPLQDNPVKLNLSRKPEENVKKQFDIKHSAMLEKLTDTKEALLKIDNILSDSIPDEDKDDNMKDSVNIIEIENANHIVENNEVPSKDSQEKFDEKMEVTLQNALENIFEIGKNENKENNELEFKEMKSLARNIVEGAENLSTLIREDITNKLNTMNELLNDVNEALENSRKSNIAYQKLKEEGEVRKRVKEIEGSREGHNSDLSSVSVVDIDNINQAISKINTEIECQESRVSTSRTNYEARSKECQDFIKEVDVILEKSHKILHPCDSKLHEDNNITGDSVKSNVGDRKELWDVDLKINKTKIDNIQKQNMERNKRIDNLLYDIKDKMKDNKEVLRLANNLLRREEGKKKLQEKKASETPHIEHDDRAMGDHVGVEENKDDNTKIPSSDIQITVQKEEDIKMEQTEKEKQREFQLKLEKEIEDKGPRMTKQFIKNHCKQHKLYSTPYLNDILYLHFKGFSKIENLEEYTGLKCIFLENNGIQRIEGLDTLSELKCLYLHYNVVRKIENLQGCPKLDTLNLDHNYVKKIENLDVVPDLHTLSLGHNMLATVEDLESLRLCNNLSVLDLSYNRLEDPLIVDVLADMALLKVLVLTGNPVVRNIPAYRKTLTLRLKELLNLDNRPVFPRDRACAEAWQRGGVQEEIAERRRWIAKDQEKVMQSVRYLIKMRDENKQKREAREREERESLGLPPVVLEDGTIDPKEANADEFVDDTKVNETEKKDQSGVVSDMLSGSEAEDSTSDESDSDSDSSENKTEMGKIEWSQVDRGKHLIQELKEEQTPEEQWSGFGLTSTADMKTSSELQAISNLLFNEPSHVEPKRVSQTLQESREILKEAFNTGEVTEEEMKTDARRKPLIEILEEYNKRPKKTEEECVPEVIHLSGITEEGDLIVDHDKKLAYERQNIVSPPTTGVFKKEGSIRKIKKMKKVTIKQVIDEEPEDNENKEENDKKDVENKEKQNETDVTTTSTQNISERRRSIIPSEGHGTAFLNYMKHMNKDPDSKESDDEDLKPSAEDEEIFKELEKEQAEREARIAAGQPAVDPMKLYDAATMEAYYKQLEAPPAHARVTRSHVTEYRHDNVFDRIALSQLTGGERPATMEAYYKQLEAPPAHARVTRSHVTEYRHDNVFDRIALSQLTGGERPDENKAKLTHVPGAVLFSYVNKQTPVEVNFEIGEEVLDSEASTAGTESIHIDSDSDSDVPDTPDYKDMKLRRPTSAGVRREENKNTNNNVPHTNEHDPANKSVLTNDMMTNDSKERCEAKRCIIDKINSYEDKRFPSQGVTADVAENARIEDSVATELLNKTLEMEEQELYRHIDAVNNHAGRVDNRTNSIIEEISDQLHQEYTVEVGRFHDTISCQHLAEVSEILETHVEEVEQRRRESAIYEDEETPPQELDTTLVGDVSNDTLVATGVGSDSMVDGDEELGHAGASNVSAEDDVFEDCIEELPDHYTLEMKLALTD-