Monarch geneset OGS2.0

DPOGS209591
TranscriptDPOGS209591-TA3222 bp
ProteinDPOGS209591-PA1073 aa
Genomic positionDPSCF300015 - 643292-657293
RNAseq coverage656x (Rank: top 20%)
Annotation
HeliconiusHMEL0170140.081.43% 
BombyxBGIBMGA006651-TA0.078.72% 
Drosophilaosp-PF3e-8034.92% 
EBI UniRef50UniRef50_E2A2N52e-16237.95%Protein outspread n=5 Tax=Formicidae RepID=E2A2N5_CAMFO
NCBI RefSeqXP_001944135.11e-14337.77%PREDICTED: similar to outspread [Acyrthosiphon pisum]
NCBI nr blastpgi|3360886181e-16738.48%protein outspread [Apis mellifera]
NCBI nr blastxgi|3838484712e-17538.79%PREDICTED: uncharacterized protein LOC100878505 [Megachile rotundata]
Group
Gene OntologyGO:00055155.1e-15protein binding
KEGG pathway 
InterPro domain[268-364] IPR0119935.1e-15Pleckstrin homology-type
[266-366] IPR0018494.3e-13Pleckstrin homology domain
Orthology groupMCL15834 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209591-TA
ATGACGACTGTCCTGGAGGTGAGCGAGGCGGACAGCGTGACTGGACACCCCCACAGTATCGCTATCACCGCGCCAGAGAGGGTCACCTTCGTGAAGGGCACATCCAGGGAGGAAGCCAGGTGGTGGACGGACGTATTGAGCGTATATCCGAGGAGCAAGGGCCGACACAAACGCAACGCCACGTTTCCCGGAGGTCAGACCGCGAGTTTACTGCAATCTTCGACAACGAGAAAATACTCCGCAGACGCGTCGACGTTGCGCGACGCAGCAGACTGCCGTCCGAGGTTCTGCGGCAGCACGACCACGTGGCCACGAGTCCAGCCGCCGCAGCCGGAGATACCCAGCCTCGCTGCCCCCACCAACATCGACACTAAAGTGTATACGGATCAGCCGGTCTCGTCCGCGTCACCGCCCACAAGGGATAAGATCAACGGCGAGGAGAAGGCTCGTTCCCGACGACGGGACACCTGGCCCGAACCCACCACCTCACCTTCCAACGACGAAGGTTCTGTCTCGTCTCCACTCCTGCAGCATCACGCTCAGTACGACGAACAGTTGCGTGATATAGCAGCGTCCCTGACCCGCCCGCGCTCACGACGAGCCCTACCCGCTCTGGAGAGACCCACACGACTGCCACCACCTGATAGATTACTGGCTCGAGGTTCACCTGATGGAGGTCCGCCTGCCGAAGAAGGTATAACGAGCTCTGCCAGTGAAGGCAGTGAACCGACCGACACAGTGGAAGCTACGGAAGGAGAAGGAGGCCGAGTGGAACTGCCAGCTGAAAGATTGCTGCATGCGCGAGCTGGGTGGCTCCAGAGACGAGGAACCGGGGGCTGGTCGCGACACTGGTTCGTACTCCGAGGAGCGGCTCTGTTGTACTTCAGGGACCCGCATGCTGAACACCGAGGTCTGATGGATGGAGTTATTGATTTGAGTGGAATATCAAGAGTGGTGGAACTGCCGTCTTCTACAACTACGAATGGATTTGCTTTTGAAACTGAGACATGGGATGGCAAACACATAGTCTTATCAGCCGTTACAGCTGGTATCAGAGCCAATTGGGTCTCGGCTATGCGAAGGACGGCCGGTCTACCGGATACTGGATCACTGTCTCTTATTTTAAGAGAGGATTCTGTAGATCAAGCTTCTGAGTCGTCAACTTCTCCTGTTACACCCATCACACCGAACACTGCGAAATCGGGACCATTTTCTTCGGATGAAGAATACAGGACGGCATCTGAAGGTGGTCGAAGGGATAGCGCAGATTGGGGAGACTTGACCCTACAACCGCCACCATCACCAATTCTCAACCGCACGCCTATCTCTAAAGTAAAAGAAAAAGTACGAGCACGTGGATGTCAACAAACTGCACCCAAACCAGAAACGAAACTTGATAAAGATGAAATCGACGCTACCAAAGAAAAAGATCAGAAGATAATAACCGAAGTTGACGAAAACGATAAACCGAATGAACCGAGGAAACGGAGTTACGTCTCGACGATAGATAAACAGACGATTGAAATAGAGGATTTACGAAATCAACTCAAACAAGCGCTGAACGATGTTAACACGGCCGAGTCAGAACTGGCGCGACTACGAAAACTCAAAGCTGAAGCCGCTTTAAAGGAAAAGAAAATGGAGGAGCTCGTGATAACACTGCAACAGAAAGAAGAAGAACTGGCTGTTAGGACGAAGGAAGCCGAGAGTTTGCATACAATAAAGCAACTATACAACGAACATAACAATATGTGGGAAAAGAAGCTGACGGAAACACAAAACTTTTTAAAAGAATCCAACGATCACTGCGAAAACTTGACCCAGCAGTTATCGACAGCTCATGACACCATAAAGCAACTGCAGAGAGAATTAAATGAGCTTAACGATAAGCTGATGAGAAGCGTCCAAGATAATGATAAGTTATATTCAAGAATACGAGAACTCGAGCACAAGGTTATAAACGAGTCACCGACCAAGGAAAAGAGGAAAAGCATTGGGTCACTGAGTGATCTCAGCAACATCAATAAAGACTTGAATCTCGAGTCTTTAGAGAAAAATAGACTGATACAAGAGTATGTAGACTTAAGGGACAGGTTTTTAAAGGCCATAGAGGAAATTAAGGCCATGAAGAAAGAGCTTCGCGAATCCCATAACATGTATGATGAACTGGAAATTACTAATATGAAGCTGAAAAACGAAATGAAACTGAGAGAACAATGCATCAGGTCAGAAATGGATTTAATGGCTACACGTATTGTAGATTTAACGCAGAAGCTGACAGCTTCAGACAAACAAGTGCGAACGTTGAAACATAAAATTCAGAAAACGGAATCCAGAGAAAAGAGGCGCAGTCTGTCATTAAAAGGCAGAGAATCTTTTACGTTAGGAAAAGAATTGGAAGAAAAGTTGACAGAACTTGAAAATAAGATTGCGTCTTTAGAAAACGGCGAATCCGTGCCTACAATAAACTCCCCTTCTAAAAGCGCTTCGCCTGCCAAGGAAAAAGTCTCTAAAACAGATAGTACCAACGACGAGAAACGAATGAAAAGATTGGCTGCTAGACTGAGACGGAAATCATTAGATAGTGCTACAAGTTCGGAGCCCATGAAAATGCTAGTCCGCTTAAGTTCGCTTGAAACCAAAGTTGCAACTGCTATAGAAAATCGCCGAGAATTAACAAATTCCTGTGAATCTTTAAGCCCCATGGCAAGATCTCCGGAAGGTACGACTAATAGCGAGAGCTTAGAAAGCTGTACAATTGGGACACAATCACAAAGGCATCTTTTAGATAGGCTTCAAACTCTTGAAAATATTATCATTCATTCACGAAGCAAAATTAATGACTGTCTTTGTCAAATGAGCGCAATGCGAGCGGCTAAGTCCAGACGATCTCCATCGCCGAGCGTCGAGAAGAAATACAGTATAAAATCTATGGAAAAGTGTTTAATGGATGTCAGCAAACGACTACAGGAGTGCTTTGATAAGTGTGTAGTAGATGCCTCAGAACAACGAACTGAGGACATTAACGACAGTGTTGCTCAGGTAGTGGTCCAGCTAGAAGAACAACTCAGGTCCAAACTTCTAGAAATATCTAAAAAGAAAGCAGCCCTTTATGAAGCTGGAGAATTAACGCAAAGAAAGAGTTTAGAAATTTTGGCAGAAAAGTTAGCATACGAGGCGGTTCTGATCGGACGAATACAGGAGGCCCCTCGAATCGTCTAA

Protein sequence:

>DPOGS209591-PA
MTTVLEVSEADSVTGHPHSIAITAPERVTFVKGTSREEARWWTDVLSVYPRSKGRHKRNATFPGGQTASLLQSSTTRKYSADASTLRDAADCRPRFCGSTTTWPRVQPPQPEIPSLAAPTNIDTKVYTDQPVSSASPPTRDKINGEEKARSRRRDTWPEPTTSPSNDEGSVSSPLLQHHAQYDEQLRDIAASLTRPRSRRALPALERPTRLPPPDRLLARGSPDGGPPAEEGITSSASEGSEPTDTVEATEGEGGRVELPAERLLHARAGWLQRRGTGGWSRHWFVLRGAALLYFRDPHAEHRGLMDGVIDLSGISRVVELPSSTTTNGFAFETETWDGKHIVLSAVTAGIRANWVSAMRRTAGLPDTGSLSLILREDSVDQASESSTSPVTPITPNTAKSGPFSSDEEYRTASEGGRRDSADWGDLTLQPPPSPILNRTPISKVKEKVRARGCQQTAPKPETKLDKDEIDATKEKDQKIITEVDENDKPNEPRKRSYVSTIDKQTIEIEDLRNQLKQALNDVNTAESELARLRKLKAEAALKEKKMEELVITLQQKEEELAVRTKEAESLHTIKQLYNEHNNMWEKKLTETQNFLKESNDHCENLTQQLSTAHDTIKQLQRELNELNDKLMRSVQDNDKLYSRIRELEHKVINESPTKEKRKSIGSLSDLSNINKDLNLESLEKNRLIQEYVDLRDRFLKAIEEIKAMKKELRESHNMYDELEITNMKLKNEMKLREQCIRSEMDLMATRIVDLTQKLTASDKQVRTLKHKIQKTESREKRRSLSLKGRESFTLGKELEEKLTELENKIASLENGESVPTINSPSKSASPAKEKVSKTDSTNDEKRMKRLAARLRRKSLDSATSSEPMKMLVRLSSLETKVATAIENRRELTNSCESLSPMARSPEGTTNSESLESCTIGTQSQRHLLDRLQTLENIIIHSRSKINDCLCQMSAMRAAKSRRSPSPSVEKKYSIKSMEKCLMDVSKRLQECFDKCVVDASEQRTEDINDSVAQVVVQLEEQLRSKLLEISKKKAALYEAGELTQRKSLEILAEKLAYEAVLIGRIQEAPRIV-