Monarch geneset OGS2.0

DPOGS210210
TranscriptDPOGS210210-TA1827 bp
ProteinDPOGS210210-PA608 aa
Genomic positionDPSCF300196 - 763372-768690
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0091521e-11080.44% 
BombyxBGIBMGA002532-TA2e-7753.80% 
DrosophilanimA-PE1e-2233.13% 
EBI UniRef50UniRef50_D6X4K89e-2129.46%Nimrod A n=2 Tax=Tribolium castaneum RepID=D6X4K8_TRICA
NCBI RefSeqNP_609691.52e-2133.13%nimrod A [Drosophila melanogaster]
NCBI nr blastpgi|2700013003e-2029.46%nimrod A [Tribolium castaneum]
NCBI nr blastxgi|2420048911e-2535.33%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00055158.9e-05protein binding
KEGG pathwaynve:NEMVE_v1g1177691e-06 
 K01125 (NAGPA)maps-> Lysosome
Orthology groupMCL26718 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210210-TA
ATGGTGTCAAAATGGAGCTGGGGTGGACAAACCATATGGTGCGTGGTGGTGGGACTAGTGTTGGAAGTGACAGCTCTCAGTGGACCTAATGTGTGCATGGTGCAACAGAAATATAATATAACCAAACGCGTCAAATACCGCGCGCCCATGTCCGTTCGGACTTACGAGTGGTGTTTTGCGATGCCGCCAAGGTGTTCCAAGTGGAGTACCGAGATGAGAGAGCTCACCAGACTCGAGACCGGTGAACGTATAGCAGAAGTGGCGGTGTGTTGTCCCGGGTACAAAATGAGGGATGTTACCTGTGTACCTATATGTCCCAGCGGGAAAATGGGAAACAATTGCTCTGAGGACTGTCCCGCAGACAAATGGGGTCCAAACTGTGCCAAGGATTGCGGTGAGTGCAACAACGGATACTGCGACTCGATCACAGGAGAGTGTGAATGTGACGAGGGATGGCAAGGTGAGAGTTGTCAAGTCCGCATGCCAACAACATCTCCGCCCGCCTTGATGGAAAAATTACTAACAACCACTAAGTCTACTGTTCAAACTCTTGCGACCTCTGTGCCCACTACTTTGACCACTATAAGTACATCCATAATGCAAATATTATCAACTTCAAAACCGTCATTATCTTCGTCGATAGTCGTAACATCGTCAACCGAAAAGACTGAAAATACTCCTAAGACGTCCACATCACAAACCACAATTACTTCAGTTGTTAATCCAACCCTTTCATATGAAAATGTTACACCTTCAACGACATTGTGGGTTGATAATAAAGAAAAATTTCTTAAAACAACAGTTACAAATTTTATCTACAATAGAACGGAAACTTCAGAGGCAGTAACTATCCCATCATCGACGACTACAGCCTTAAAAACATCTAGTACCACTACTATCAATGTCACTGAACCCAGTACAATTGAGCATATCACCACTGCACCTTCAACAATCACCTCTGTTACACCTACTGTGAGTTCTACTGTTAAACGAAATTCAACTACAACAGAACCAAAGATAAATACAGTGTTGACTGTTGAAAAAGAAATCAGAGCGGAACCTACTCCTAAAATTGAAACAACAACGAAGAATCAATCTACATCAGTTAAATTTAAGCCAAAAGAAATTTGGATAAAACCATCACAAAAAGAACCTGAGCATATCACTGCCGTGATGAGTGATAGGGAAAGAGATCATACTTCGCTAGATCTAATATCTGTGATAAGTATCGCAGGAGGAGTGATGATGGCTGTAATAACAGTAGCAGTCGTCATCGTCATGATAGAGAGGTGTAAGAGACACAGATACGATGACGTGAGGAAAATCAATGACATACGAATGCAAGTCATCATGGACAATAACGATGAGCCTCCTCCCTATGTTAGAAGTATATTCCATACACCATTACCAGAACCTCCAACTACGGATAGAAATCATTATCAACCAATATCGACCTTAGATAGAAATTTAAAACAATTCATGCGGCCTGTCGTTGTACAAGCCATCTCACCTGTAATGTTGGAGAACTTCAGAGGAATTTTAGAATGTCATTACGACCACTTACCACACACTAATCAAGATTTTGGAACCATACCTGTCCGTTGCTCTGTCGCGTCATCTATGAAGTATGATGAGAAACTTCTTAGACAACGACCTCTTTCGGTTGCGGATTACACTATCGATTCATTAAAGTGTGAGGCGAAACTGGACGTTATTGACTGTACGACATCTGAACCATTATATGCAGAAATTCCTTGCTGGAGACCTCCATCTGAACACGCTATAGAAGTTGTTAACTTGAACGGAGAAGCTGTAACGGAATTATGA

Protein sequence:

>DPOGS210210-PA
MVSKWSWGGQTIWCVVVGLVLEVTALSGPNVCMVQQKYNITKRVKYRAPMSVRTYEWCFAMPPRCSKWSTEMRELTRLETGERIAEVAVCCPGYKMRDVTCVPICPSGKMGNNCSEDCPADKWGPNCAKDCGECNNGYCDSITGECECDEGWQGESCQVRMPTTSPPALMEKLLTTTKSTVQTLATSVPTTLTTISTSIMQILSTSKPSLSSSIVVTSSTEKTENTPKTSTSQTTITSVVNPTLSYENVTPSTTLWVDNKEKFLKTTVTNFIYNRTETSEAVTIPSSTTTALKTSSTTTINVTEPSTIEHITTAPSTITSVTPTVSSTVKRNSTTTEPKINTVLTVEKEIRAEPTPKIETTTKNQSTSVKFKPKEIWIKPSQKEPEHITAVMSDRERDHTSLDLISVISIAGGVMMAVITVAVVIVMIERCKRHRYDDVRKINDIRMQVIMDNNDEPPPYVRSIFHTPLPEPPTTDRNHYQPISTLDRNLKQFMRPVVVQAISPVMLENFRGILECHYDHLPHTNQDFGTIPVRCSVASSMKYDEKLLRQRPLSVADYTIDSLKCEAKLDVIDCTTSEPLYAEIPCWRPPSEHAIEVVNLNGEAVTEL-