Monarch geneset OGS2.0

DPOGS203020
TranscriptDPOGS203020-TA1221 bp
ProteinDPOGS203020-PA406 aa
Genomic positionDPSCF300068 + 364745-372211
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0110344e-1941.22% 
BombyxBGIBMGA012257-TA2e-2386.54% 
Drosophila% 
EBI UniRef50UniRef50_E2AYV92e-3842.02%Otoferlin n=3 Tax=Coelomata RepID=E2AYV9_CAMFO
NCBI RefSeqXP_002430476.14e-4044.09%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3504168427e-4042.02%PREDICTED: otoferlin-like [Bombus impatiens]
NCBI nr blastxgi|3504168424e-3942.02%PREDICTED: otoferlin-like [Bombus impatiens]
Group
Gene OntologyGO:00055152.1e-11protein binding
KEGG pathway 
InterPro domain[325-383] IPR0089732.1e-11C2 calcium/lipid-binding domain, CaLB
[328-381] IPR0000084.7e-07C2 calcium-dependent membrane targeting
Orthology groupMCL30914 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203020-TA
ATGCGCTACTGGCAGCACACTAGTCAGATAGCTACAAAACTCGACTTACCTCCGAAATGCTTTTGCCATGAAATATTAGCTAGACAAGTTCAACAAGAGTGTTATATACACAGCAGAGCAGCGTCTGTAGCAGCAGTCTCTAGATATTATAGTTCAGCTAAGACGGAGATCTTCCTCCCATACATTACTCATGCGGTCTACATTGACATGGATGTTTTCTACATCGAGGAACATCTTTTAAAAGAGAGAGCTGCTTTAGAAGCAGAAAGAACAACGGTAGTGGAAATTCACAGCCAGCGTGATCCTGATTTGCCGAGGGTCCACGTCTCTCGTCCGCCGCACCGTATAATACATGATCACATGATGAATCCTATGCCATCCACTAGCAGGCAGCCAGATCCCATACCAGCACTACAACTGGAACAGGATCCTGAACCCGAAATCGAAACGGTTCAGATGGAAGAAGATGAAGACACTAAATACTTGGATGAACTTGAGCGCAATATCGCCGCAGCTGCGCGCTCCTTACAAACACTAGAAGTCGTCGCCGACGTCGAACCAACTGATGAATCAAGATCCCAAAAACTTAAAAATGTCGGAAAAAAGATTCTGAGCAAAGGATTGTCTTTTGGGGCTGATAAAACCGTGAGACTCGTAGAGAGGAGCACCTCGCCCCAAAACAGACCGTCAACCAGCCGGCAGCAGACCAGCAGGCGTACTTCGCCTTTGAGAACTATGAGAAATCTGATTCGATTCGGTCGAGCTCGTCGTATTGGCAGCGGTGACGAGGAACAAGGCTTGCTAGAATGTCAACCAGGACCGTCATCAGCTCCAGATCGACCAATGAGCCATTCTTCTGAAAACTTGGATGAAATACTCGTTGATACCAGTACAGACTCTGACTTTCACAATTTCCGTAAAATACGGACAACTATAGACTCGGGCCGCGCTACGGCCCTGAAAGCTACTGACTTTCAAGTCTGCGTGACGATAATCGAGGCTCGCCAACTGGCCGGCCTCAACATGGACCCCGTGGTGTGCATTCAGGTGGGAGAGATTCGCAAGTACACCAGCGTCAAGGGCAGCACGAACTGCCCCTTCTACAATGAGTACTTCGTGTTCGACTTCCACATGCCGCCGGTGATGCTCTTCGACAAAATAATTACACTATCGCGAGAGGATGTTTTACCAGATATAAGTGACGAAGATCCTCCGAGATGA

Protein sequence:

>DPOGS203020-PA
MRYWQHTSQIATKLDLPPKCFCHEILARQVQQECYIHSRAASVAAVSRYYSSAKTEIFLPYITHAVYIDMDVFYIEEHLLKERAALEAERTTVVEIHSQRDPDLPRVHVSRPPHRIIHDHMMNPMPSTSRQPDPIPALQLEQDPEPEIETVQMEEDEDTKYLDELERNIAAAARSLQTLEVVADVEPTDESRSQKLKNVGKKILSKGLSFGADKTVRLVERSTSPQNRPSTSRQQTSRRTSPLRTMRNLIRFGRARRIGSGDEEQGLLECQPGPSSAPDRPMSHSSENLDEILVDTSTDSDFHNFRKIRTTIDSGRATALKATDFQVCVTIIEARQLAGLNMDPVVCIQVGEIRKYTSVKGSTNCPFYNEYFVFDFHMPPVMLFDKIITLSREDVLPDISDEDPPR-