Monarch geneset OGS2.0

DPOGS215484
TranscriptDPOGS215484-TA3750 bp
ProteinDPOGS215484-PA1249 aa
Genomic positionDPSCF300098 + 424702-436254
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0034280.082.44% 
BombyxBGIBMGA007322-TA0.075.81% 
DrosophilaCG17360-PA4e-3229.15% 
EBI UniRef50UniRef50_Q16L832e-5526.52%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16L83_AEDAE
NCBI RefSeqXP_001656248.14e-5626.52%hypothetical protein AaeL_AAEL012744 [Aedes aegypti]
NCBI nr blastpgi|1571334357e-5526.52%hypothetical protein AaeL_AAEL012744 [Aedes aegypti]
NCBI nr blastxgi|1571334354e-6526.75%hypothetical protein AaeL_AAEL012744 [Aedes aegypti]
Group
Gene OntologyGO:00055158.6e-11protein binding
KEGG pathway 
InterPro domain[1008-1107] IPR0018498.6e-11Pleckstrin homology domain
[1009-1106] IPR0119935.3e-10Pleckstrin homology-type
Orthology groupMCL25335 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215484-TA
ATGCCTTTCGACAAAGCTTCTGAAATTAAATGCAAGGAGATCGATAAAATCAAACGTCGTCATAAAACTGACGACTCTGCTGTTGCACCAACTCTAAGATGTGGTGAAGATACTTTACAGTGCTCAAAAATGGAGAATGTGGACGAAAATACTGGAAAACCAACCGATTTGTGTTTACAAGGCAATACCCACCTCATATCTAGTACAACGAGCAGTCTTGCCGATACGCCCGGCGATTCTGGAGTTTTGTGTTTGGATAGCGAAGCATCAGAGGCTACGAGTCAAGCTCTTATGTCTCATAGCCTTGTAGGGGCCGAAGAATTAACGTGCGATAGTATCTACGATGCCATTCCTAGTGGTTCCCAAGACATGATAAAGAGTACCAATAGCAGTATCATGACTCGCAGTGACATAGATAACCAAAATGTAGGAGCACCTGATGATATTGTACCTGAACTTTTGATAGAAGTTCATTCAAAGCCTGTGTATGAGAATCTACCAGATGTTGTTCTAAAAGCTTTGGAGAGTGAGAAAGTTTGTACTGATGATGTTAAAAAGTCTGCACTAAGGTCAACTGAACCAAAGAAATCTTTATCAGAGGACATTGTATATAGGCGGAGATGTAGAAAAAAGTCGCAAAGTGGTCCCAGCTCACAGAAAAAAAGGGTGTCATTTCATGAGGACATTTTAAATAATACCAGAACTGATAACATTCATATAGAAAGAGGTTTCATTTCTTATGGGCCAAATGGTTCATATTGTGATAAATTCAGGCAAAAGAATGCTGTAAATGACAGATTCTCATGGTGTTCCTCTGGAGATAGGCAACAACCCAAATATGCTGCAGATGTAGCTCAACAGACAATGTCAGATATACTGACATATGGGGATTCAAAGCATGACAGAAGTGGTATATTTGAATATACTCAAGGCTTTGAAAGAAATAATCAAGATAATAAGGATTCTGATAATAATAATGAAACAAAGGCCACAAACAAAATGTATGGCTGTGACTCAAATAGCTCAGATTCAAACTTCAGCTGCGATTCAGAAACCTCATCTAGTGACTCTTCCTCCTCACACTCACATAAAACAGAAACATTGCAAGTCCACAAAAAAACCGACAAAACCCAAAAGTCGTTCTCATGTGATGGCTTCGAGAATGGTGGAGGGCATGTGTCGTTCATAAGAAAGACATATTTTTCGGAAGCGGATATAGATCAGAACAACGATCGAAACAACAAGCCTTTAGAAGTCACACAGAGCCCGATCACAGTGAAATCGGTCCTCAAGAAAAAAAGATACATCAGCACAAATATCGTAGAGGAGAGAAAGAATAATAACAAAGTCTTGAATTTACTTGATGCCAATAACATAATTGATTCATTAAAGAATTTTTATAAAAATTTTAATTTCAACTTTGCACCCGAAAAGGGCTTGCCAGAAACTAGTTTGGAACTTAACAACGTTGTTGAGGCCCTATCATGTGATGTCAATCAGAATACTAAGAAAATATCAAAGAGTCTAGACTCTGGATTCCAAATCGACGATGAGGATGACTTTGTTGAAATCAATCTTAACTCTCAACCATTAGAATTGGATAAACCTAAAGAAGTAATGACTGGTTCCAGGACCTCTGTTACAGGCGATCGAACCCCAAGCGAGGGCTCGCCTCGACATAAACTGACCCTTAATATGAAAGCGCAACACGATAAAAAACTATTAAACAAAGACGTGCTGCCACCGCTCAGCAAGTACGTGGTGAACTGCGAGAGTACTGTGTATGAGCACAAAGGAGTTTCTTACAGCTACGTGCATGATACATTTCAAACAGCATTCGAAGCACCCAAAGAATCTTTCGTACCGATACCAGAATCTACACCCATTAAGGAGATCGTGTTGAGTTCAAGCAAGACGGATTTGAACAAACAGTCGGACGATAATAATTCAAGTAGGACATCTACACCAAAGCGCCAGTTCAACAAGAACGTCCAGGAGGAAACTGAACAGAAGAACGGCATGTCCAAGCACCTGTCATCACCAAAAAAGCAGAATGTCAACAGATACAGTCGCAAATCCGCTTCAACGGCTGCACAGAAGAATGATGACGTCAGCGACAACTGCTCCAACACAGACAACTCAACCCTAAAAGGCGCCGACGACTTCAACGACGACTTCTTCGACAGTCCATCGAATCAGAAATTGCAAAGCAACAAATCAATCATACTAAACAGATATCTCAAAAACGTATGTCAGAAGAAAGATCTCGAACTGAAGATAAAAAACAACAAGTTCTATCAGCTCAAGTTGAGAATGGAGAGGCTCTTCCCCAGCATCATATATCCGTTTAAAGAGATGTCCGACACTTATCACATTTCCGCAGCAAGGATGTCGCGGCTCATCGACAAGGAAATTATAGAGAACAAACGTTTGTCAGTCAGGTTACTGACGGCCGATGAACCGTCATACTTTGACAGCTTCGAAGACAATGTGGCCATCGAATGCAATGAAAAGCTCAGGCTTCAGATATTCTCATATCCTAATGAAACGCTAAATAGGATGATGAAAGTACAGTCTCCATACAACATGGAGGACGGGTCTTGGACGCCACTCCTAGTTTTCATCACGGACTACGCGCTTTACGTGGCCAGTGTGAAGCCGGGGGGTACGGAGTATGACATCCTTTGCAGACTACCCCACGATGAACTAGACGCAATCGCTGTCGGTCCTGAAGCTCATTACATACAAGTTATCGATGTGGCTGGTAACATAGCGTGTTCTGTGGTCACTGGCGAGTCGTCCTTAGGGGCGAGGCTGGCGTCAAGTCTGGAATGGAGCGCTAGGACTTCGCCCCTCCGTATACAACGCCCTCCGGCTATGCTGCCGCTGTCTTGCAAGCATCTAGCCGCCGCTGTCACCAGGAAGAGGCATGAAACAAGGGTGCCGCCGATTTTGTACTACGGTCGAGCGTGCTCCGGGGACGCCGAGGATCGTTTGGTGGAGGCGCCTGGAGCCTTCGCCCCCCCGCTGGAGGGGTACCTCATGTGCAAGACTGATAGAAGCAAGAGATTTGAACCGTGCTACTTCCTACTAAGGGCTGGTGTCCTCCACTGGGGTCCTCACTCGCCGGACGGGACCCCACTACCCAATAACATATCTCTCCGTTCCGTGGTAGGGGTCAGGCGTCATGTGGACGTGTCCCGCAGACCCCACTGCTTCGAGATAGCATTAGGGGACAATAGCAGACTGTTACTGGCGGCACCCGATGACCCCACAGCGTCATCATGGCTGCAATCACTATTACTACACGCTGCTCAGGCTCTCGAAGAAAAGGATGGATGTAAGAAGTGTGAACGTGTGGGTCCACCACCAGCCTGTACTATACTGGTGACCCCTAACGAGGTGGTCAGTTTGAGAGACACCTTGGATCTAGCCTTCGTAGCAGGAGCAGCCGCCTATCTTAAAGACAGGGGCTCGGAAGCCCTGTTCAAAGGTAAATATTACAAGCAAATAAAAATTAAACACCATGAGTGGAGTGTTTGTGAAGCTCGAGAGCGCGGCGCTGGACTCGCTCTGTCACTGCCGTCCGCCGCGGAGCTCGAGAGACTACTAGCGGCAATCGAGGACGCGCACTCAATGTTAACGCATGAACCGTTCCCATTGAGTGTTGTGGAGGCGTCAAAATCCCCCTGCAGTTTAGCCTACCACAAGTACGAGAAGGCCTGGTCACACATGTTACCCCAACAGTGA

Protein sequence:

>DPOGS215484-PA
MPFDKASEIKCKEIDKIKRRHKTDDSAVAPTLRCGEDTLQCSKMENVDENTGKPTDLCLQGNTHLISSTTSSLADTPGDSGVLCLDSEASEATSQALMSHSLVGAEELTCDSIYDAIPSGSQDMIKSTNSSIMTRSDIDNQNVGAPDDIVPELLIEVHSKPVYENLPDVVLKALESEKVCTDDVKKSALRSTEPKKSLSEDIVYRRRCRKKSQSGPSSQKKRVSFHEDILNNTRTDNIHIERGFISYGPNGSYCDKFRQKNAVNDRFSWCSSGDRQQPKYAADVAQQTMSDILTYGDSKHDRSGIFEYTQGFERNNQDNKDSDNNNETKATNKMYGCDSNSSDSNFSCDSETSSSDSSSSHSHKTETLQVHKKTDKTQKSFSCDGFENGGGHVSFIRKTYFSEADIDQNNDRNNKPLEVTQSPITVKSVLKKKRYISTNIVEERKNNNKVLNLLDANNIIDSLKNFYKNFNFNFAPEKGLPETSLELNNVVEALSCDVNQNTKKISKSLDSGFQIDDEDDFVEINLNSQPLELDKPKEVMTGSRTSVTGDRTPSEGSPRHKLTLNMKAQHDKKLLNKDVLPPLSKYVVNCESTVYEHKGVSYSYVHDTFQTAFEAPKESFVPIPESTPIKEIVLSSSKTDLNKQSDDNNSSRTSTPKRQFNKNVQEETEQKNGMSKHLSSPKKQNVNRYSRKSASTAAQKNDDVSDNCSNTDNSTLKGADDFNDDFFDSPSNQKLQSNKSIILNRYLKNVCQKKDLELKIKNNKFYQLKLRMERLFPSIIYPFKEMSDTYHISAARMSRLIDKEIIENKRLSVRLLTADEPSYFDSFEDNVAIECNEKLRLQIFSYPNETLNRMMKVQSPYNMEDGSWTPLLVFITDYALYVASVKPGGTEYDILCRLPHDELDAIAVGPEAHYIQVIDVAGNIACSVVTGESSLGARLASSLEWSARTSPLRIQRPPAMLPLSCKHLAAAVTRKRHETRVPPILYYGRACSGDAEDRLVEAPGAFAPPLEGYLMCKTDRSKRFEPCYFLLRAGVLHWGPHSPDGTPLPNNISLRSVVGVRRHVDVSRRPHCFEIALGDNSRLLLAAPDDPTASSWLQSLLLHAAQALEEKDGCKKCERVGPPPACTILVTPNEVVSLRDTLDLAFVAGAAAYLKDRGSEALFKGKYYKQIKIKHHEWSVCEARERGAGLALSLPSAAELERLLAAIEDAHSMLTHEPFPLSVVEASKSPCSLAYHKYEKAWSHMLPQQ-