Monarch geneset OGS2.0

DPOGS207483
TranscriptDPOGS207483-TA4827 bp
ProteinDPOGS207483-PA1608 aa
Genomic positionDPSCF300051 + 454415-470246
RNAseq coverage1673x (Rank: top 8%)
Annotation
HeliconiusHMEL0148570.060.06% 
BombyxBGIBMGA000957-TA0.054.21% 
DrosophilaCG10732-PA3e-1294.29% 
EBI UniRef50UniRef50_D2A1X23e-2929.30%Putative uncharacterized protein GLEAN_07786 n=1 Tax=Tribolium castaneum RepID=D2A1X2_TRICA
NCBI RefSeqXP_001842447.14e-1945.39%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|2700056881e-2829.30%hypothetical protein TcasGA2_TC007786 [Tribolium castaneum]
NCBI nr blastxgi|1571247883e-2925.00%hypothetical protein AaeL_AAEL009976 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[1293-1342] IPR0241386.7e-17Pericentriolar material 1 protein
Orthology groupMCL26660 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207483-TA
ATGGCACCTGGTAATAAAGAGAGGCATCCAGGACACACGGGTACTATACCCAAAACGAAAGTGCGAAACAATTGCAATGTGCCGCAGGAGGCTGTAACACCAACACGACGCGACATGGCCACTAATATGGTGACGTCTCGGAAACAAACAAACAATACCTCAGATCAGGAATTGACTGATATAGACAATTCAGCTTCATACGAAGGAGGTTTTGGTCGTTTTGTGCCAGTTCGCCCTCAACCGCGTGCTCTGCCGCTGACAGTGCAGAACAGGGAGGGTCTATCGGATCAGCATGATGATGCAGCTTCCACGAGCTCACATACATTGCATCCATACGCTCATCAAATGCAGAATGGTTTAATACCACATTCAAACCACGCGGCTGCGGCTGTTATGATAGTTAATCCGACAACTCCGAATGCGGTCACAGGGAATATGAACCAGCAGATAAATACACCAAATCAGAACAGGAACAATCTGACAAGGAATTTGAATTCGCGTAACACGTTCGATGGTAACGCCAATAACAACACACCAGCCGTAGCTCACAGCAGGTCCGGGACGAAGTCTGTCAACCTCATGAACAATATGTACGACCAGGGACAAGGCAATTTCAATAACGTCAACGAGAATCACTCTCACAGGGAGTCAGCCCGTGCGTTGGGCGAGGCTATAGTGGCGGCGTGTCCTGACGCAGCGGCGGTCGAGAGAAGACTGGGACAGATACGAGAGTACATACGAGTGACGTCATCACTCGTGGACACCATGCGGAACTCTGAGGATGAGGCTTTTATTGAGCACACAAAAGAAGAATACGATGAGCTAGTGAAAATGGTGATGCAGTTGAAAGAGAGCGAAACAAAACTCGAAGCTCTATTGGTTAACAACGACCCAGGCGAAGCTGTGACCCAAAACGAGAACACTACTTCGGAGCGAACGGAAACCGCTGACGATAATACCGAAGTTGAAAAAGACGTTGATCAAGATATACAAACAGATAAAAATAACGTAAAAAACAACGAAAACCAAACGAACAACCTCAAAAAAGTATCAAACATAGAAAATGAACATCTGAACAGGTCCATTGCGAACGTTATAGAAAAACCTCACGTGCAAGAAGAGTTCATGACGCAAGATGATGAGGAGGTCGCTCTCGCCGGAAAACTCGGCAAAGAAACTGAAACAGCGATATCCGGGAGTGACATCAAGAAAGACGATCAAGAACTCAACGACATACTGAGGAAGAATATAATGGCGTGCGTGGAAAAGATCGCCGTCATAGAAGACATGAAAAATAAGAGAGATTTTAACAACAACGAGCAGGTTTCTGTGGATCAGGATGTCGTAGTGAGACCGGTTTCGGTACAAAGCAACGGATCCGCGTACAGCGACAATGAACTCTGGCAGAGTGAAATAAAAAACAAAATGGAATTGTCACAGATAAGATTGACGGCTCTCAAACAACAGCAGAAGAGATTGCTTAAATATCAGGCTGAAGCCAAGCAGCAGTTAGAAGAACTGAATAAGGAGCGCCAAACCCAGGAGATACATCATCCATCGACATCACACGACAACTTCGCTGGAATGCAGAACATGACAATAAATGCAAATTATCACCCACAAAACCGTATGTACGCCACGAACCCAATCGGGGTGGCCACGACTCTGCCGCAGGCGGACATGACGTCACAGATGACGTCACACATGATAGACAACGACCGGGCGCACGCCATTCACGATGGTTTCCTCCTAAGGGATGGCAACGCTGGCAGTGAGGAAGCTCTGTGTTCGGAAGACGAGGCAGCCGGAGGACGGCTGAGGGAACAACTACGGGGCCTCCGAGCTAAGAAGGTTCACATGGAAAATCTGGTTTCCGAGTTTCAAAAACTTCAAATGGTGACACGTCTCGGCAGCTCTCACAACGCGTACTCGAGTTCAGGCGAAGACAGCGCTGAAGACGAACCGTCCCATAAATGTAACAAATCAGGAGTCAACGTGAAAAATGACGACCCATTCAAACTCATGGAAATAAAAGCTATAAAGACAGCTTTACAAAAGACTAAAGATCTGATGAGGTCAGTGGAGACGTTTGAAGCTGATCACCTCTCCGGAGTCAACGAGGAGACCTTCCATGCGCCGAACCATTCCAACATACACGACAATAAATCTGAAGGCGGCGGCAGTATGGGCGATTGGTCTAACGACGCAAACGAAGGAAGCGTTTGTCGCGTTAGAAACGCTATCGCTAGGCACAAGTTCACTAACGAGCACCAACAACATCAGCAACAGCATCAACAGCAACAGCTCCATCAACAACAGCGGCAACAACAACAACAGCTGCAACACCAACAACAGTTTGTACAACAACAGCATATGCAACACATGCAATCACAGCAGTTACAACAACAAAGCCACGAAGCTAACATCAACTCTCTACAAGAACTTGCTCAAGAGCTTCGAATGGAAACTGAGAAGATAATGGGAGAAAGAGCGAGGATTAAGGACATTGTTACTAATAAAGAACGCAAACAAAAGAAGGTTAACGAGGAGGTGGTACGTGGTACGGTGAATGGTGTCGGTGGTGGGAGCAGTAGTGGTTTGGGCCCAGCTGAACGACGGCAGATGCAGCTCAGAGCGTTGCTGGCTGACAAACAAAGGGAATTAGAAGCGCTACTTAACAAGAAGGGAGACAGCGCGTGCTCTCGCTCGCTGCAGGCGGCCGCGTCACAGCTCGCGAGCGCTTACGCCAGCGACAGCGACACAGCGTCTAGAAACAAGATCAATCAGAGAAATCGTTCACGTCGAAGGACACAGGAGCGACCATCGGATGAGAATAATGTGTCCAGCGGCAACTCGGAGGCTCAAATCATTCTCAAAACTGACTCTGGACCTGGATCCGTGTTGAACGGATCTTACAATAGCGAACAGGAGGAGCAATCTCAGTCCGAACATGTACAGGTCGACTCGCAGAATTCTAGAGAGAACGAAAATTACAGGGACGTCCGTACAAATCAGTTCACTCAGAACAGCCACTCTCAGACCATGGAGCGCCTCCTGGGTACGCCCGTGGGTTTCCCGCCCGCCATGCCCTTCAACATGATGATGCCGTTTGGTATGATGTGCGCGTGCGGGTGTGCGTGCAGTGCGTGGGGCGCCTGGAGCTGGCAGCAGCTGGCGGCGCAGGCGAGGGACATACACGCGCTCAGGGAGCAGATCACCCATTTGGAGGAGCGTTGGCGAGCCGAGACGGCGTCACACACCCTCAATAACCAAGTACCGCCCGGGAACAGAGCGAACAACTACTGGGATAACTTTAGAAGCTATTCCCGCCAGAACTTACTGTCCACGAACAAGAGCAACGAAGGTCATGTGGGTCACGCGTTGGTCGAGCGTTCGCATAACAGTTTGCACCACACGTCGTCGTCCACTTCTCCGTCTTTAACACCAAAGCGGAATACAGACGCTCCGCCCACCCAGCAGGTTGCTCCGCCCACCCACGGCCCCGCCCATGGCCCCGCTCGAGGCTACGAACGCTCTCTATCCTTCTCCTCCGCCCCCGACGTCCTCAACGTGAACCAGAACCCGGAGGAAGACTCCAACCTGAACGTTAACCGTCCCGAACCTCCTCCCCAGAGAAACATCAACTACGCTAACCCGATACCGGAACTTATTCAATCCCACTCGAACAATCTGAACGCGCCAGCCAACAAACGGAAGTCGTCAGCTAAACTTCCCGCCAAAAACGCTGCGAATAAAAAATACGCCCTCGTGTTCTCACCGACAAGCAATATAGATATAGCGAGCTCGTCCGTCGATACAGCGAATCCGAATTTGGCGTCCGGCAGCCGTGAGGTACAGGACAAGGCGACTTCCAAGCTGTTCGATCTGCTCAGAGAAAATATTTACTCCGAAGTGACGACTTTGATTGGAGTGAACGAGTCACATCCGGATTTCCTGATACAATTGTTCCGGGAGCTGCAGCTGATCAGCTCGGACCCTCTCAGGCAGCGTGTGCTACAGTCGATACGAGGCGTCCTGGCACAGTACGGACCCTTGATGGACAATCAGAATGACGAACCGGAAGAATTGACCACCGAAGTGACCACCACCTCCGCAGAAACCACCTGCCAGGATTGTAACAGTCAAAACGCATGTTCCAGCAGCGTACAGATCGATAATAAAATCGTACGATTCCTCTTACTGAAGAGCGAAGAAGTATTTACAACAGATTTGCTCGAATCTCTCTCTTCACTCATAATAAACCAGCCGCCTGACGGAAGGCGGAGCAAGAAGCTCCTAGACATGCTCTCGAAATACGAAGGTCTGAGAGTCTGCGACGTGTCTAGTGACATAATAGAAAACATGACTCTGTTCACGTCAGTGAACAACGAAGGAGACGAAGCGTCCCGACACGCGAACGAAATCTCCGAATCTGCGATACTGCAGGACGTAGCCGGGTCATTGAACTTGTTCAGTTCGAGCGAGACTCAGCTACATCAGATTAACACGTGTCCGTACATGTGGCCCGGTCACGTACACGTGGACATTAACGACAGTCCCCGCTCGAGTCAAGAAGGGAAGGTCGATCTCATCAACTGCTCGGATATGCACAATGGCGACTTGGCCGAAGCCGATCAGACTTGCCGCACCGACATAGAAGACGGACAAACACATAACGAACTGGCCGGAAATGACGCTCTACCGGATGTCGTTGACACGGACGAGGTGACGCCCACAGAGGCCGAAGCTGAGTGGTTCGGGCTTGACCGTGTGCCAACAAGGCTGCATATGGAAGAAACATCTGAAAAGCAGAAAGGTGAAGTTTAA

Protein sequence:

>DPOGS207483-PA
MAPGNKERHPGHTGTIPKTKVRNNCNVPQEAVTPTRRDMATNMVTSRKQTNNTSDQELTDIDNSASYEGGFGRFVPVRPQPRALPLTVQNREGLSDQHDDAASTSSHTLHPYAHQMQNGLIPHSNHAAAAVMIVNPTTPNAVTGNMNQQINTPNQNRNNLTRNLNSRNTFDGNANNNTPAVAHSRSGTKSVNLMNNMYDQGQGNFNNVNENHSHRESARALGEAIVAACPDAAAVERRLGQIREYIRVTSSLVDTMRNSEDEAFIEHTKEEYDELVKMVMQLKESETKLEALLVNNDPGEAVTQNENTTSERTETADDNTEVEKDVDQDIQTDKNNVKNNENQTNNLKKVSNIENEHLNRSIANVIEKPHVQEEFMTQDDEEVALAGKLGKETETAISGSDIKKDDQELNDILRKNIMACVEKIAVIEDMKNKRDFNNNEQVSVDQDVVVRPVSVQSNGSAYSDNELWQSEIKNKMELSQIRLTALKQQQKRLLKYQAEAKQQLEELNKERQTQEIHHPSTSHDNFAGMQNMTINANYHPQNRMYATNPIGVATTLPQADMTSQMTSHMIDNDRAHAIHDGFLLRDGNAGSEEALCSEDEAAGGRLREQLRGLRAKKVHMENLVSEFQKLQMVTRLGSSHNAYSSSGEDSAEDEPSHKCNKSGVNVKNDDPFKLMEIKAIKTALQKTKDLMRSVETFEADHLSGVNEETFHAPNHSNIHDNKSEGGGSMGDWSNDANEGSVCRVRNAIARHKFTNEHQQHQQQHQQQQLHQQQRQQQQQLQHQQQFVQQQHMQHMQSQQLQQQSHEANINSLQELAQELRMETEKIMGERARIKDIVTNKERKQKKVNEEVVRGTVNGVGGGSSSGLGPAERRQMQLRALLADKQRELEALLNKKGDSACSRSLQAAASQLASAYASDSDTASRNKINQRNRSRRRTQERPSDENNVSSGNSEAQIILKTDSGPGSVLNGSYNSEQEEQSQSEHVQVDSQNSRENENYRDVRTNQFTQNSHSQTMERLLGTPVGFPPAMPFNMMMPFGMMCACGCACSAWGAWSWQQLAAQARDIHALREQITHLEERWRAETASHTLNNQVPPGNRANNYWDNFRSYSRQNLLSTNKSNEGHVGHALVERSHNSLHHTSSSTSPSLTPKRNTDAPPTQQVAPPTHGPAHGPARGYERSLSFSSAPDVLNVNQNPEEDSNLNVNRPEPPPQRNINYANPIPELIQSHSNNLNAPANKRKSSAKLPAKNAANKKYALVFSPTSNIDIASSSVDTANPNLASGSREVQDKATSKLFDLLRENIYSEVTTLIGVNESHPDFLIQLFRELQLISSDPLRQRVLQSIRGVLAQYGPLMDNQNDEPEELTTEVTTTSAETTCQDCNSQNACSSSVQIDNKIVRFLLLKSEEVFTTDLLESLSSLIINQPPDGRRSKKLLDMLSKYEGLRVCDVSSDIIENMTLFTSVNNEGDEASRHANEISESAILQDVAGSLNLFSSSETQLHQINTCPYMWPGHVHVDINDSPRSSQEGKVDLINCSDMHNGDLAEADQTCRTDIEDGQTHNELAGNDALPDVVDTDEVTPTEAEAEWFGLDRVPTRLHMEETSEKQKGEV-