Monarch geneset OGS2.0

DPOGS216055
TranscriptDPOGS216055-TA2985 bp
ProteinDPOGS216055-PA994 aa
Genomic positionDPSCF300067 + 138334-147435
RNAseq coverage5998x (Rank: top 2%)
Annotation
HeliconiusHMEL0150540.061.69% 
BombyxBGIBMGA008860-TA0.050.79% 
DrosophilaTpnC25D-PA3e-7082.43% 
EBI UniRef50UniRef50_Q5D0X13e-7831.35%Arylphorin hexamerin-like protein 2 n=3 Tax=Acridoidea RepID=Q5D0X1_ROMMI
NCBI RefSeqNP_001040443.15e-8097.33%troponin C 25D [Bombyx mori]
NCBI nr blastpgi|1140519769e-7997.33%troponin C 25D [Bombyx mori]
NCBI nr blastxgi|1700432017e-7731.09%arylphorin subunit alpha [Culex quinquefasciatus]
Group
Gene OntologyGO:00068101.7e-60transport
GO:00053441.7e-60oxygen transporter activity
GO:00055093.6e-21calcium ion binding
KEGG pathwayptr:4561532e-23 
 K02183 (CALM)maps-> Salivary secretion
    Olfactory transduction
    Alzheimer's disease
    Phototransduction - fly
    Glioma
    Phosphatidylinositol signaling system
    Insulin signaling pathway
    Neurotrophin signaling pathway
    Melanogenesis
    Oocyte meiosis
    Phototransduction
    GnRH signaling pathway
    Plant-pathogen interaction
    Long-term potentiation
    Gastric acid secretion
    Vascular smooth muscle contraction
    Calcium signaling pathway
InterPro domain[94-596] IPR0137881.7e-60Arthropod hemocyanin/insect LSP
[94-596] IPR0155631.7e-60Hemocyanin-related larval storage protein arylphorin-related
[412-647] IPR0147562.9e-54Immunoglobulin E-set
[412-646] IPR0052032.5e-50Hemocyanin, C-terminal
[165-411] IPR0089222.5e-36Uncharacterised domain, di-copper centre
[35-164] IPR0052041.9e-32Hemocyanin, N-terminal
[165-404] IPR0008961.4e-29Hemocyanin, copper-type
[926-992] IPR0119923.6e-21EF-hand-like domain
[967-992] IPR0182486.3e-07EF-hand
[966-994] IPR0020481e-05Calcium-binding EF-hand
Orthology groupMCL10073 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216055-TA
ATGAAGACGTTAACATTAGTGTTTATTTTAGGGCTACTAGTTCTGAGCGATGCATTTCTCATAAAAGTTCCAACTACAGCTGGTACTTCGAGAATTGCACCAGTTGGATGGGTTCATGTACAAAAGTTACTACTACCGTTATTTGAAAATGTATGCGAAGAAAGTCAAGACCCCACCGTGGTTAGATTAAATGAAGAATTTAAACTGGATCCGAAACTTGGAAATTATTTGAAACCAGATGTCATCGATAACCTTCTAAATTTGAAAGCCACGAAAGGATTATTAGAAAAGGGACAAATATTTACTGAATATAAATCTAGTCACTTAGATGAGCTCAAGGCGGTATTCGACGTTTTATATTATGCAAACGATTTCAATACTTTTTACAAAGCTGCTTGTTGGGCGAGACAAAATTTAAATTGTGGATTATTTGTTGATGCTGTATATTTGGCTCTTTTAAGTAGACGTGACACGGAAAAAATTTCTATACCACCACCATACGAACTGTTACCTAATTATTTCGTTCAAAAGGATTTTATTATTAAAGCATCTTCATTAATAAGAGGTGAAGAAATAACAACAGACACTGTTCGAAACGAAGGCAATGCTTATATTTTAGATGTTAATTACACAACAAACTTTTACGACGACAGTGACGTCAATTTGGCTTACTTCCATGAAGATGTTGGCTTAAATTCCTATTATTTCTTAACAAAACTTAAAAATTGTCAATGGCTTAACGATAATGGATCCGTTAAAAACAATGGGCAGTACATATATCATACACTTAAGCAATTAAATGCAAGATATAATTTGGAAAGATATGCCAATGGATTACCAGAATTAGTGGGAATAGATTGGGATTACCTTAACGAGTCCCCGTACGATCCTATGCTAATATATTCAAATGGAAATAGTTTCAGTGAGAGGATGTCTATTAATAACGATAATGAACTCATAAATGTACTTAAAAATATGGAAAGCAATCTTCCCGCTGTTGTTATGCACATGAGAGATAATGGGTACAATAAATCGGAAATACTCAACCACTTAATGGACATACTAGTTAAGGACGAAAGGAGTTATGAAAACTTAGCACTGCAAATTCTTAGTGATTCTGATTCAGACTCTTACTCTGTTTTAGAGCACTACATGACAACTGTAAGAGATCCTATCTTCTGGAAATTAAATAAGAAAATTATTGATATGGTAGACAATGCTTTAAGTATTTTGCCAACCTATACAAGAAACCAACTTTATATCCCCGGAGTTGAAGTACAAAACGTAGAGGTCAAAAAAATGATAACATCATTTGACAATTTTGAATTTGATGTCACTGATGCTTTAAAAACGGAGAGTGATGAAATCAAGTTTCACGTTAAGATGTCTCAAAACAGATTGAATCACAAACCTTTTACATTTAAAATTAATGTCGCATCCTTAGTAGCTCAAAAAGCACTCATTAAAATATTTATCGGTCCTAAGGTAATGCCGGGTGAACTAGCTAAGAAGAAGAACCTCTTTATGTTATTAGACTGCTTTGAGACTAATTTGAAAATAGGTAGTAATGTAATAACGAGAGCATCAAACGAAATGACGGATTTTTCCGAAGACCTTATCTCACAAGATATTGTATATAAAAAAGTGTTGGATACCGAATTTGGAATAGATGCGTTGCCTCTAAAATCAGTTACATCGCAAATCAAGTTCCCTTCTCGATTAGCCCTTCCTAAAGGTACTAGCACTGGTTTACCACTGCAAGTATTTGTTTTTATAGCTCCATATATAAAAGCGAATGTTGGTGGATCAAGAGCTAATGTCGAATTAAATTCCGATGCAATATTTTCTCCTGGATACCCTCTTGATTTAGACATAGAAATACAAGAGTTATTTAATTTACCAAATGCTTTAGTGAAAGACATCATTATAACACACAAGAGTGAAAGTCTTAATACCGACAGTAAGCTTGGTAATAATTACAAAGATGGTGTCAACACTAGACCATGGCAAAAGGATGAAATAGATTATGTAAATAATCAAAATAGCCTGTTGGCTTTGGCTGAGCGACCTGCATTTACTAAGAAAACATTTGATTATAAATCTAAAAAGAACCAATATGGAAAACGGCCAGACTATTCTAATAAATACTCAGAAAAAGACAAGGAAATTAAGACTAATGAACAAAATCTAGATAAAAATGATGCGATATCTGAAGTGAATCCTGAACCCGAAAACATGGAGATTATAAATAACATTTACATTACTCCTGAAATAAGCCGAAAGGTTGAATTGGACACAGACGAAAGTCAAGGAAATGGTTATAAGTTTTTATTAGATAAAGATGAAATGTATAATGTAATACATAAAGAAGTAAATGAAGACTACAATATGAAAAATAAATTCATCCCAAGAAACGAAAAGGATGTATCCTATAAAGTAGTTGTAGACAAAGATGACCTTAATAATATTTTACATAAAGAAGTAAATGAAGAAGTTAAAAGTAAGTTAAACAAATATGAAAAAGATTCAAAGGACACTGATGACGACCAGAAGATGGCTATGCTCCGCAAGGCATTCCAAATGTTTGACACCACAAAATCAGGATACATCGATGTCCTAAAAATCTCAACTATCCTGAACACCATGGGACAACTTTTCGATGATTCTGAATTACAAGCCCTCATCGATGAAAATGATCCAGAGAGCACTGGCAAGATTAACTTCGATGGATTCTGCAACATTGCCTCTCACTTCTTAGAGGAGGAGGATGCTGAAGCTATGCAACAAGAACTGAAGGAAGCTTTCAGATTGTACGATCGTGAAGGAAACGGTTACATCACCACTTCAACTCTCAAAGAGATCCTTGCAGCTCTGGACGACAAACTTAGCAATGCTGATTTGGATGGTATCATTGCTGAGATTGACACGGACGGATCTGGCACCGTTGACTTCGATGAATTCATGGAGATGATGACGGGAGACTAA

Protein sequence:

>DPOGS216055-PA
MKTLTLVFILGLLVLSDAFLIKVPTTAGTSRIAPVGWVHVQKLLLPLFENVCEESQDPTVVRLNEEFKLDPKLGNYLKPDVIDNLLNLKATKGLLEKGQIFTEYKSSHLDELKAVFDVLYYANDFNTFYKAACWARQNLNCGLFVDAVYLALLSRRDTEKISIPPPYELLPNYFVQKDFIIKASSLIRGEEITTDTVRNEGNAYILDVNYTTNFYDDSDVNLAYFHEDVGLNSYYFLTKLKNCQWLNDNGSVKNNGQYIYHTLKQLNARYNLERYANGLPELVGIDWDYLNESPYDPMLIYSNGNSFSERMSINNDNELINVLKNMESNLPAVVMHMRDNGYNKSEILNHLMDILVKDERSYENLALQILSDSDSDSYSVLEHYMTTVRDPIFWKLNKKIIDMVDNALSILPTYTRNQLYIPGVEVQNVEVKKMITSFDNFEFDVTDALKTESDEIKFHVKMSQNRLNHKPFTFKINVASLVAQKALIKIFIGPKVMPGELAKKKNLFMLLDCFETNLKIGSNVITRASNEMTDFSEDLISQDIVYKKVLDTEFGIDALPLKSVTSQIKFPSRLALPKGTSTGLPLQVFVFIAPYIKANVGGSRANVELNSDAIFSPGYPLDLDIEIQELFNLPNALVKDIIITHKSESLNTDSKLGNNYKDGVNTRPWQKDEIDYVNNQNSLLALAERPAFTKKTFDYKSKKNQYGKRPDYSNKYSEKDKEIKTNEQNLDKNDAISEVNPEPENMEIINNIYITPEISRKVELDTDESQGNGYKFLLDKDEMYNVIHKEVNEDYNMKNKFIPRNEKDVSYKVVVDKDDLNNILHKEVNEEVKSKLNKYEKDSKDTDDDQKMAMLRKAFQMFDTTKSGYIDVLKISTILNTMGQLFDDSELQALIDENDPESTGKINFDGFCNIASHFLEEEDAEAMQQELKEAFRLYDREGNGYITTSTLKEILAALDDKLSNADLDGIIAEIDTDGSGTVDFDEFMEMMTGD-