Monarch geneset OGS2.0

DPOGS204534
TranscriptDPOGS204534-TA1806 bp
ProteinDPOGS204534-PA601 aa
Genomic positionDPSCF300297 - 237064-243733
RNAseq coverage15x (Rank: top 81%)
Annotation
HeliconiusHMEL0165773e-4782.79% 
BombyxBGIBMGA002499-TA5e-3430.64% 
DrosophilaKul-PA1e-3731.08% 
EBI UniRef50UniRef50_Q9VAI22e-3531.08%Kuzbanian-like n=12 Tax=Drosophila RepID=Q9VAI2_DROME
NCBI RefSeqXP_002098551.13e-3631.08%GE23876 [Drosophila yakuba]
NCBI nr blastpgi|448942235e-3528.41%ADAM metalloprotease CG1964 [Drosophila melanogaster]
NCBI nr blastxgi|1951124454e-3828.35%GI10418 [Drosophila mojavensis]
Group
Gene OntologyGO:00065089.2e-07proteolysis
GO:00042229.2e-07metalloendopeptidase activity
KEGG pathwaydya:Dyak_GE238768e-36 
 K06704 (ADAM10)maps-> Alzheimer's disease
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[184-412] IPR0240793.9e-30Metallopeptidase, catalytic domain
[347-410] IPR0015909.2e-07Peptidase M12B, ADAM/reprolysin
Orthology groupMCL19877 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204534-TA
ATGTTTTGGCTTTTTATTTTCTTTACATCCGTAAGAGGCTGGGACTTCCAGGACCAATTGGAGAGAGCCGACAGCATATGCAGGTCCTACGGTTACCAGGTTGTAGAAGGGATATTCGGCAAGGAAACTTATCTGTTATGTCTCAAGACTCTTGCTATTGGTACCTCACCACTTATGAGTCAACAGACAACATTGCTGTTGGATCCTGATATGGATGATAAACTAGTTCCCAAGTTTTCTCCGGTATTAGCGGATATAGTAATGTCGAGGGGGTTCCCCGTGGATGATCCGGCAGCCGGTTCAGCTGAGGGTTACGTAGGAGAGGACAATCATTTTTACGGAGTCTTGCATCTAGGTAACCGAACATTGCACGCCGATCCGTATGGTAAAGACGGTCCCGATAAATCGTTCGGGGTTCTCATTAACGACAACATGGCGGCACCTAAAAGACCACGCCGCGACCATGCACAGGTCGGAGCCGGTTTCGACAAGCTCTCATACTTCCCATACCGCATAGAAGACAGAGCCCCGGAAACCGGCATCGCTAGAGCTCGTATCTGCGATCTGCTTCTGCTAGCTGATAAGGAATTCTACGAAAAGGACTCTAATTCGAGTATAAATCACGTGGTGCAAAGGATGATGCATGCCGTGGCCCAAGCTGATATTATATTTAGGAACTCTGATTTCAACGAGGACGGCGTACCAGATAACATTGGTTTTGCAGTAAAGTATATAGTAATTCTAACGTCTGATGAGACGAACAACCGTGTCTTCGGTGATCTGGTTAAAGACCGGTCTATAGACGGGCGGAATTATCTCATGCGGTTCGCGAGGCTCAGGAGACTGTCTGAAGTCTGCCTCGGGGTCGCCTTCTCTGGACATGCCTTCCTCAACAGGACCTTGGGATTGAGTTTCACGTCGTTGGGCGGAGGGCTAGGTGGCGCTGCGGGCGGTCTGTGTGACCGACGCGCCTACGGACGCTCCTTCAACACGCTCGCCCTCGCGCACGCCACCGGCGAGCGCGACCGAGTACCCGAGAGGCTCGCGGCGCTCACACTGGCGCACGAGATGGGTCATAGTTTCGGAGCCCACCACGACGACAATTTCCCAAACCCTGACTGCCGCGGTTATCTGATGGGGTCACAGTCAACCCCCACCAAACACTCGGAGTTCTCTGTCTGTAGTAAGAGACTCATTACGGCTACCCTCAGCAGCATGAGCTATTGTCTCACGGAAGTGGACCAGCCGTATTGTGGGAATGGTATAGTAGAGATCGGTGAGGCTTGTGACTGCGGTCTACCCTCCGAATGTAGTCAGAGGGACCCGTGCTGCACCCCGCGGGCCGGAGGAGCACTCGTGTATGAAGAGGGGACCTTATATAAAGAGGGCTGCTCCGTATCACCTGGAGTCAGTTGTCACCCCTCCCAGGGTCTCTGCTGTAACGCTAACTGCGAGTTCGCCAATCTTACGAGCAGTGGAATCGAGTGTCATCACCAGCACCACGAGTGTACGTGTGCGGATTTATCGTCCTGTGACTGTGGTGTGGGCGGGCGATGTCTCCTGGACGGCACCTGTCACGCCGCAGACTGCGCCGGGCTCGGTTTAAAGGAGTGCAAGTGCCCCAAATCCGGACCGGGCGGTACATTAAAAAAATATAGAAAGTGCGGAGTGTGTTGCCAGTTCACAAAATCCGGTGTAACGAAATGTCAAGGCGTTGAGTTTGCGGCGAGGGAATTGATCGCCGAATCAGCGCTGCCACCCTCGCTTTTGCCGAACAACACTTACAAAGGATGTGGGTGTCGATAA

Protein sequence:

>DPOGS204534-PA
MFWLFIFFTSVRGWDFQDQLERADSICRSYGYQVVEGIFGKETYLLCLKTLAIGTSPLMSQQTTLLLDPDMDDKLVPKFSPVLADIVMSRGFPVDDPAAGSAEGYVGEDNHFYGVLHLGNRTLHADPYGKDGPDKSFGVLINDNMAAPKRPRRDHAQVGAGFDKLSYFPYRIEDRAPETGIARARICDLLLLADKEFYEKDSNSSINHVVQRMMHAVAQADIIFRNSDFNEDGVPDNIGFAVKYIVILTSDETNNRVFGDLVKDRSIDGRNYLMRFARLRRLSEVCLGVAFSGHAFLNRTLGLSFTSLGGGLGGAAGGLCDRRAYGRSFNTLALAHATGERDRVPERLAALTLAHEMGHSFGAHHDDNFPNPDCRGYLMGSQSTPTKHSEFSVCSKRLITATLSSMSYCLTEVDQPYCGNGIVEIGEACDCGLPSECSQRDPCCTPRAGGALVYEEGTLYKEGCSVSPGVSCHPSQGLCCNANCEFANLTSSGIECHHQHHECTCADLSSCDCGVGGRCLLDGTCHAADCAGLGLKECKCPKSGPGGTLKKYRKCGVCCQFTKSGVTKCQGVEFAARELIAESALPPSLLPNNTYKGCGCR-