Monarch geneset OGS2.0

DPOGS203986
TranscriptDPOGS203986-TA3426 bp
ProteinDPOGS203986-PA1141 aa
Genomic positionDPSCF300005 + 1219794-1235355
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0110344e-9134.52% 
BombyxBGIBMGA012258-TA3e-6431.63% 
Drosophilamfr-PF9e-6631.68% 
EBI UniRef50UniRef50_E2BLP35e-8935.33%Otoferlin n=9 Tax=Neoptera RepID=E2BLP3_HARSA
NCBI RefSeqXP_968595.15e-9434.33%PREDICTED: similar to otoferlin [Tribolium castaneum]
NCBI nr blastpgi|910799031e-9234.33%PREDICTED: similar to otoferlin [Tribolium castaneum]
NCBI nr blastxgi|910799031e-9034.33%PREDICTED: similar to otoferlin [Tribolium castaneum]
Group
Gene OntologyGO:00055151.3e-22protein binding
GO:00160215e-13integral to membrane
KEGG pathway 
InterPro domain[739-965] IPR0089731.3e-22C2 calcium/lipid-binding domain, CaLB
[331-407] IPR0125615e-13Ferlin B-domain
[87-150] IPR0129685.9e-08FerIin domain
[747-826] IPR0000081.3e-07C2 calcium-dependent membrane targeting
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203986-TA
ATATCAGTGAATGTACTGGAAGGAAGACGGTTAGCATGGACGAATCCGCATGCGGCAAATTCCTTTGTTCTCATTGTTCTTGGCAAGAAGAAACATAGAACGAGCGTAAGAAGAAATATGGAGGAACCATGTTATAGAGAAAGCTTTGTTTATGATTTATATACAACTGTAAATGACCTGAAGCAGACAGCATTGTGGCTAGCGATAATGGAACCGAGATTCTGTGCACCGCCACGATTGTTAGGAGAAGCCAGTATCGACTTAGGAGAAATTTGTATCGATGATCTAGATCATCAATCCTTTCATAAATGGGCGCAATTACTTCATCCCCGTGATTGGGCTGCTCAGCCGGTCGGTTTTCTTCAAGTTGACGTGTCAATTATTTCCAAAGCAGAGGAGCAAAACCTAATGCCAATAATTGGAAAAGATAAATTAGAAGACAAACTGCTTCTCCACTCTGACGCTCAACAGCAGTGCGCCAATTATGTTATCACAGTGCATGCAGCTTTAGGTCTACCTAATAGCACCCACGATGAAATAGGCAAGCGTATTGGGAACCCTCCCAATACTTTCGTAAAAGTTTCCTTTTGTGGCCTTGTGGCCAAAACTGGAATAATTCGTCGTAATAATAATCCGAAATATAGCGAACAAATATCTATAGTAGAAATGTTTCCTAACATGTGCCAAGCCATACGTTTCGAAGTTTACTCTGTCGAAAGGTGCTTTCATCGAGTTATATCCTGCACCCAATTGAAACTCGGCCAAGTATCACATGACGGAGAGAATGGGTTTGTACCAACGTTTGGACCATCCATGATCCAAATGTATGGTACAACGTGTGGTGACTCGTCGACTGTGATGTGTCAGGAGAGTCCATATCATCGTGGAGCTTTGGTTGTTACTCTTAAAATTATAGTGCCGTATAGTCAGCGAGGAATCAGAACTATAGCAGTCGAACCAGTGCCACGTATCAATTTAAGTTCAGAAGGCTGGCCAGATATCGTGGTTTGGCTTCTTAATAATGGCTCACGAGTGGCCTTCGCAAAAATATCCGCTGCTGATATTGTCCACTCCGTCATATCAGAACAAAAGGGAGAGTATTGTGGCCGAATACAAACCCTTTGCTTAAAGCCGTTGAAATGTCCAAAGCATATGAATTCCCCGTTATTGAGTTGTTACTGCGTTGCGGGAAAAGTAGAACTTCTAATGTGGATGGGGCTTCATCGCCAATCGTCTGATTTTGAATTCTCTATACCTCGTGGATATAAACTAAGAATGAAGAAATACAATATGTTCATTAAATCTAATATTATGTCGCCTCCTATGAATATTTTGCATGTCATCGTAAACTTTTCTATCGGTAGTGATGATTCAGCCCTTAAGGCTAACTCGTCGGACTTCCCGATTGAAGTCAGCACTGTCGCCGGTTTGGAACCAGGTATAATTGAGTTTTCCCTGGGTAAAACGGAATTGTTAGGAAGGATAAAAATTTATCCCCAAGTAACTGATGAACCGAGCTATGATAATGCACCCAGCCTCCAGTGGTATGATTTCTATCGAGGTACCGAATTCGGCGGACAAGTTCTGATGTCAGTACAATTGTTACAGTTAAATCTACCGCCATCTACAGATAGAAAGATACCTGAGAGATTACTTAGGTCTACGGAATACACTTCAGAGGAAGAAATTTTCGAAGCGAACACTGCCGACGACACCGAGATTTTTGAAGGAATTGAAACTCTTCCAATAAATCTTCTTCCAAAATCTTCTTCTTATAAAGTCGACGTTTATTGGTGGGGACTTCGGGATATCGATTCGATGCGAAAACCCTGTATTGTAATGGAAATTGAAGATCTCAGTATAAAATCTGAAATAATTACAGAGAAAGCTCACAACTGTAATTTTTCTAAAGGTCGAACCACGCAGGATTACATTGACGAAGATTCTACTACATTGAACACATTTCATAAACTGATCATATACGAGACAGAAGTTGAGACTCAGCCAGAATTTTCCAAGTTTAAAGATTGGTGCGCTACACTGAAACTTTATAATGGAAAGAAAACAGGTATTCCAGATAAGGACGAGAAGCTTTACTGCGGTTTCTTAAAGGCGGGATTTGCGATATATAAGTGGCCCCCGCCAACTAACACGATCGCTGTTACTCCCAGCGGAATTGACTTAAATAAAGGATTTTTTAACGATCATCCCCACAATAATCCTGCTGAATTTCTCGTCCGTGTTTACATTGTGAAAGGACTTAATCTCAAATCTAAAGAGTTCACTGGGCAATCTAATCCTTACGTTGTGTTAAATTGTGGAAACAAACATATAGCTGATCGAAACAACTACGTCCCGAACTCAGTTAATCCAATATTTGGAAAAATGCACGAAGTTCATTGTTGTCTGCCTGATGACTATCTTTTGGCAGTTTCGTTATATGACTACGGAATAAACTCGCCTGATAAATTAATTGGTATAACAACAATTGACCTAGAAGATCGCATATATTCTAAACACAGAGCTCGAGTTGGTTTACCTTTAGATTACAGCCTAAACGAACCTTTCAAATGGCGTGATTGTTTAAAACCTTCAGACATTCTCGAAGAAATTTGTTCAAAGAACCATCTTCCGCCTCCAAGATTCATAAACAGTAATACCTTGTTAGTCAATGGTGTAGAATATAGAAACAATGAGAAAGACGCTACTTTTTCATCAGCCGCTCTACAAAGAGAAAAGCTTTGTTTGAGTATTCTTCATAAATGGCATACGTTACCAATTTGTGGGTATCACCTTGTTCCCGAACACGTTGAAACGAGGACATTATATGACCCAAATAAACCGGGAATCGAACAGGGTAAAGTTATATTATGGGTAGACATTTTTCCTTTGGAAACGGGTGTTTATATCCCACCCCCCATTAAAATTACGCCTAGAGAGGCTGAAGATTACGAACTCAGACTAACTGTTTACAATGTTCGAATCAAAATGAGCGACTTAGATAACTTAGGAAGACAAGTCTCTGACATTTACCTTGTTAACAAAGAGAGAGGTATCTTTACAGAAAGTGGCGACAACGTACCGCCTGTACTTGTTGTTCAGGTCTTAGACAACGATGATTTAAATCAAACCGAGGATTTAGGGAAACTTATGCTAAATCTGAACTCTTTGACTTGTGGAGAGAAACAAGCTCAGGACTGTTCCTTAGAGTCCTTGAATAACAATAAAAAAATTGATTTGTTCTATAGCGAATCTATAAAATCCTGGTGGCCTTTAGCTACAGTTGATGAGAGTTCTGGAGCATTAATTTTTAGGGGTTGCATATACTTAGAACTGACATTGATGCCATTGGAGAAAGCTGTTGTAATGCCGGTAGGCGTCGGAAGAGAACCTCCATTTCCTTTGTTGGCACCTGCGTTAGTATAG

Protein sequence:

>DPOGS203986-PA
ISVNVLEGRRLAWTNPHAANSFVLIVLGKKKHRTSVRRNMEEPCYRESFVYDLYTTVNDLKQTALWLAIMEPRFCAPPRLLGEASIDLGEICIDDLDHQSFHKWAQLLHPRDWAAQPVGFLQVDVSIISKAEEQNLMPIIGKDKLEDKLLLHSDAQQQCANYVITVHAALGLPNSTHDEIGKRIGNPPNTFVKVSFCGLVAKTGIIRRNNNPKYSEQISIVEMFPNMCQAIRFEVYSVERCFHRVISCTQLKLGQVSHDGENGFVPTFGPSMIQMYGTTCGDSSTVMCQESPYHRGALVVTLKIIVPYSQRGIRTIAVEPVPRINLSSEGWPDIVVWLLNNGSRVAFAKISAADIVHSVISEQKGEYCGRIQTLCLKPLKCPKHMNSPLLSCYCVAGKVELLMWMGLHRQSSDFEFSIPRGYKLRMKKYNMFIKSNIMSPPMNILHVIVNFSIGSDDSALKANSSDFPIEVSTVAGLEPGIIEFSLGKTELLGRIKIYPQVTDEPSYDNAPSLQWYDFYRGTEFGGQVLMSVQLLQLNLPPSTDRKIPERLLRSTEYTSEEEIFEANTADDTEIFEGIETLPINLLPKSSSYKVDVYWWGLRDIDSMRKPCIVMEIEDLSIKSEIITEKAHNCNFSKGRTTQDYIDEDSTTLNTFHKLIIYETEVETQPEFSKFKDWCATLKLYNGKKTGIPDKDEKLYCGFLKAGFAIYKWPPPTNTIAVTPSGIDLNKGFFNDHPHNNPAEFLVRVYIVKGLNLKSKEFTGQSNPYVVLNCGNKHIADRNNYVPNSVNPIFGKMHEVHCCLPDDYLLAVSLYDYGINSPDKLIGITTIDLEDRIYSKHRARVGLPLDYSLNEPFKWRDCLKPSDILEEICSKNHLPPPRFINSNTLLVNGVEYRNNEKDATFSSAALQREKLCLSILHKWHTLPICGYHLVPEHVETRTLYDPNKPGIEQGKVILWVDIFPLETGVYIPPPIKITPREAEDYELRLTVYNVRIKMSDLDNLGRQVSDIYLVNKERGIFTESGDNVPPVLVVQVLDNDDLNQTEDLGKLMLNLNSLTCGEKQAQDCSLESLNNNKKIDLFYSESIKSWWPLATVDESSGALIFRGCIYLELTLMPLEKAVVMPVGVGREPPFPLLAPALV-