Monarch geneset OGS2.0

DPOGS208594
TranscriptDPOGS208594-TA3642 bp
ProteinDPOGS208594-PA1213 aa
Genomic positionDPSCF300052 - 429960-442567
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0165910.062.29% 
BombyxBGIBMGA013376-TA0.056.95% 
DrosophilaCG33978-PA6e-3049.52% 
EBI UniRef50UniRef50_D6WYX04e-13443.13%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WYX0_TRICA
NCBI RefSeqXP_001810762.19e-13542.97%PREDICTED: similar to CG33978 CG33978-PA [Tribolium castaneum]
NCBI nr blastpgi|2700125701e-13343.13%hypothetical protein TcasGA2_TC006727 [Tribolium castaneum]
NCBI nr blastxgi|2700125701e-13242.97%hypothetical protein TcasGA2_TC006727 [Tribolium castaneum]
Group
Gene OntologyGO:00055091.6e-07calcium ion binding
KEGG pathwaydpo:Dpse_GA285288e-07 
 K02599 (NOTCH)maps-> Dorso-ventral axis formation
    Notch signaling pathway
InterPro domain[817-851] IPR0130911.4e-08EGF calcium-binding
[817-865] IPR0018811.6e-07EGF-like calcium-binding
[706-796] IPR0000826.6e-06SEA
Orthology groupMCL16585 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208594-TA
ATGGATGGATCTATTCCACATTCCACAACCGCCTCGGGAGCATTTAATACAAATGCTATAGTCTATGACAACTCGCCATGGAGACCGGGAAGAGATGTGGAACCGGAAAGAGGAGAAGATGAAAGGAAACTAAAAAATCTAGCTACCAGAATTATGTCCAACGGAGTAGAGGTTTTGGTTAAGGACAGAAGCGCAGAGGAGAAACAGGAACAGACAAGATACTCGGACGTGAAAACAATACAACCTTCAGCAGTTAGCAACAGTTTCGTAAATAATAATCCTAAAATTGATAAGAGAATTGACGATGATCATAATCATTTTGATCTGATAGAGCCGTCAGAGATGAGTCATACTTCATGCTCGACCTCTTGTACGTGCAGTCATATACAAACAGAAAGTTTAAAAACCAAATACACAAATGTTAAATCAAAGAGCACGAAATATAGCGGAACGAAAATTGATGTACCGCCTACCCATCTTTCATCTTACGATCATACAAAGTATCATCCTTACGAATATGAAACTAGTTCGCATGGACCACCCACTGATAAAGTTGTAGAAAAATACACACAAGTTACTGATTACTCTGATGAAAACGATAGTAATTCATTGGACATGAGCGATTTGCTTACTAATGAACAAGCTCCTGAACCACCGAAAACTAAAAGCACAACATCAAAACCACCAGAAAAAAGCAAACCTGATAAGCCTCAGAAAGTTAATAAAAACAAGTTTGTTGTGGCTGAACTTATAAAATTAGGTTCACTGGGAATAAAAGGTTTATCACAATTAGCACCTGTGATCGAAAAAATGACCGGTGGATTCATGAGACGGCAGGAAACAAATAGAACGACTTCTACCACAACTACAGTGAAACCGATAAATAAAGTAGTAGGTTACAGTGCTAATAAGAGAGTGGATAATGAGGTAGAATCTAAACACAATAATTTTCCAATATATATTCCTGTTGATGAAATGGAAATGGCAGAATCTCAGATTGGTTACACTAATGTTACATTGCAACAAAATATTGCATGGGCTGCAGACCATAAAAATTCTAAAGTAAATCTTATGAAATCTAAAATAGTGCATGAAAGCCCTTTAGTTAATGGAGGTATACCTATAAGTCCAGGGGAAATAATTACCACCAATTCTGATGTAATTGTGGGAAAGCCTGCAGTTGGAGGTCCAGTATCTTTAGTAGGCACTGGAATGAAGTTACAAAATCAGGCACAGGCCCCAGACAATGCCGTTGCACACAGTGATATGTATAGCATTAAAGAGAAACCTCTAAATGATTACCCTATGGTCGGTACAAAAATCGATGATTCCTATGATTTAAGGCCACCAGAGTTACCCAAGCCTAATGCAATGGTAACTAAAAATGCAATACGACCACACGGAGCTCATTTTTCTCCACCTAATATTCATATTCCTTTACGGAACTCTGGAAATCTATATAAATCTAGCGAACATAGTGGTCAAATAAGTTATAGTAAGGATAAATCACCCAATTTAGTTTATCATGGACGTCCATCTATTTTAGATTATAAACCATCTTTCACAAATAGTGTGAAAAAACCTTTTGAAAATAAATCAACAAATAAAGAAAACGATCAACAAGAAATACCTGATAATAATCCTAGCTCCTCAGAAATCGTCAGTACTCATATTATGACGGATGGACAAGGGACCGATTTCGAAATTGTAGGTGCAATGAATAAACCTTTATTAGTCGATATACAGCCTTCAAAAGTTGCTAATGTACTGATACCTCATGGCAGTTCAACCGCACTAGTATTTGCGGGATCTGCAGAGCCCCATAAAACTGGTGATTATGTTGATGATCCTTTACCATATCCAGAACCTGGTTATTTTGGAAGTTTTAGCATAGATGCACCTCATATGACAAATGTACATAATGTTGCAAGCTACGGAAAAGAATGTCAACCAGATTGTAAGGCTTCGAGAAATGAAAGATGCCAGAGGATCGATAGTGTTATGAAGTGCGTGTGCAGACCAGGCTTCGCTAGAATGTTCCCAGATCGACCGTGTAAACCAACGTATACGTACTCCGTAAGATTAGGACTGGGGTCTAGAGATAACAAGGTGCTCAAATTCCATAAAAGTCTATCGGACAATTCTACTAAAGAATATGAGAGTTTGTCATTGGCTACTCATGAAGGAATAAATCGAATGATTATGCAATCAGACTTAAGGGATGTTTATCACGGAGTCCACATAACAGGATTCCATCCTATTGAGATGAGAACAAAAGACGGAGCCTATCAGGGTGTTATCAATGATTTCTACGTCCAGCTTTCCGATAACGCCCATGAAAGTAGACTGAAGGAAGTGATAGAGAAATATCTACGGAATAATAACTATAGCCTTGGTGGAACAGAAGTTTATGCATCTGAAGAATTTATTGAAAGCCTTAATGTCAGCGATTTCGACGAATGCACGAGCACTCAGTTCAATGACTGTTCCGAACACGCCCGCTGTTTCAACCTTCGCGGAACTTACACTTGCAGTTGTTTAGAAGGTTTTGCGGACCTCAGTGTCAACACTCTATACCCTGGGAGGATATGTTCTTCTGACGCAATAGGCTGTGCAGGCTGCAACTATCATGGCACGTGCTTCGATCGTGAGAACGCGATGATTTGCGAATGTTTCAAGTGGTACGCGGGACGAACTTGCCAGGTCAATTTGAAAGCTGTTTTGATAACAGTCACCGTGGTAGGGGCGCTGGTCATCATAGTGGTGACGATCTGGGCGTCGAAGAGATGCTGCAGTCAGAAGAATCCCACGAATCAAACGTTTGTTATAGGTTGCATGCAAGGAATGCCAAGTTTACATCAGGGTAACGTACCATCAAAGCAGAGAGCTGACAGACGAGCATTAATTGCTGAAAGAAATGAAACAGCAGAGACATGTAGCGTGCAAAATGCTTCACTACCTTACGCGCCGTCAAAATCTCGGTCGCGATCACACAGTAAGCAGGCGCCGGAGCCTCCTCCTCACTCCCCACCTCCGCCGCCCGCCCTAATGATACCACGTGCAAGACTACATCCACTACATGACAGTCGCGATAATTTGTCACGTAGGAAGAGTAGCGAAGTGTGTAACGAAGCTAAACTTATCAGTTACTTGGAATCGGGAGCGACAAACACTCAGGAGATGCGGAGAAAACACAGCATTGAATCATCGTACAGTGTAAATAAAGAGAGAGCTAATAAACAAGGTGCACTTGTATCAGCTGGTTATAAAGTTTCAACGACCATTCGTCCAGACGAGAACTCAATCAAATGTGAAAGGGACGACACTTCGTCCATCAACAAAAATGATTTAGAAGCCGAGCTGTCACGCTTCGACACACTTCGCAAGTCTTATAGTCAAGAAGATATGTCAGAATGGACGGATGCTGAACGTCGTATTGGGGAGTTGACTTTATCTGAAGCTAGATCGGTCGGGGGAACTCTTCCAGCGAGCACTGGCAGAGCTGCTTCATCCACCAGACTCACGCATCAGGAAGCCAACACCATGGCGGAACGAGACTTAGGCTCCACTTTTCTCCTGCCGCACGTGCACCTCTATAAACCAGACCTTACCAGTGACGTGTCCGAGTTCGACTCCCTGTGA

Protein sequence:

>DPOGS208594-PA
MDGSIPHSTTASGAFNTNAIVYDNSPWRPGRDVEPERGEDERKLKNLATRIMSNGVEVLVKDRSAEEKQEQTRYSDVKTIQPSAVSNSFVNNNPKIDKRIDDDHNHFDLIEPSEMSHTSCSTSCTCSHIQTESLKTKYTNVKSKSTKYSGTKIDVPPTHLSSYDHTKYHPYEYETSSHGPPTDKVVEKYTQVTDYSDENDSNSLDMSDLLTNEQAPEPPKTKSTTSKPPEKSKPDKPQKVNKNKFVVAELIKLGSLGIKGLSQLAPVIEKMTGGFMRRQETNRTTSTTTTVKPINKVVGYSANKRVDNEVESKHNNFPIYIPVDEMEMAESQIGYTNVTLQQNIAWAADHKNSKVNLMKSKIVHESPLVNGGIPISPGEIITTNSDVIVGKPAVGGPVSLVGTGMKLQNQAQAPDNAVAHSDMYSIKEKPLNDYPMVGTKIDDSYDLRPPELPKPNAMVTKNAIRPHGAHFSPPNIHIPLRNSGNLYKSSEHSGQISYSKDKSPNLVYHGRPSILDYKPSFTNSVKKPFENKSTNKENDQQEIPDNNPSSSEIVSTHIMTDGQGTDFEIVGAMNKPLLVDIQPSKVANVLIPHGSSTALVFAGSAEPHKTGDYVDDPLPYPEPGYFGSFSIDAPHMTNVHNVASYGKECQPDCKASRNERCQRIDSVMKCVCRPGFARMFPDRPCKPTYTYSVRLGLGSRDNKVLKFHKSLSDNSTKEYESLSLATHEGINRMIMQSDLRDVYHGVHITGFHPIEMRTKDGAYQGVINDFYVQLSDNAHESRLKEVIEKYLRNNNYSLGGTEVYASEEFIESLNVSDFDECTSTQFNDCSEHARCFNLRGTYTCSCLEGFADLSVNTLYPGRICSSDAIGCAGCNYHGTCFDRENAMICECFKWYAGRTCQVNLKAVLITVTVVGALVIIVVTIWASKRCCSQKNPTNQTFVIGCMQGMPSLHQGNVPSKQRADRRALIAERNETAETCSVQNASLPYAPSKSRSRSHSKQAPEPPPHSPPPPPALMIPRARLHPLHDSRDNLSRRKSSEVCNEAKLISYLESGATNTQEMRRKHSIESSYSVNKERANKQGALVSAGYKVSTTIRPDENSIKCERDDTSSINKNDLEAELSRFDTLRKSYSQEDMSEWTDAERRIGELTLSEARSVGGTLPASTGRAASSTRLTHQEANTMAERDLGSTFLLPHVHLYKPDLTSDVSEFDSL-