Monarch geneset OGS2.0

DPOGS203409
TranscriptDPOGS203409-TA1656 bp
ProteinDPOGS203409-PA551 aa
Genomic positionDPSCF300003 + 1318923-1329754
RNAseq coverage411x (Rank: top 29%)
Annotation
HeliconiusHMEL0063790.072.82% 
BombyxBGIBMGA012346-TA7e-14971.21% 
Drosophilab6-PA2e-6835.44% 
EBI UniRef50UniRef50_E2B5G84e-9443.11%Neuronal pentraxin-2 n=6 Tax=Formicidae RepID=E2B5G8_HARSA
NCBI RefSeqXP_001810992.12e-10143.85%PREDICTED: similar to GA15926-PA [Tribolium castaneum]
NCBI nr blastpgi|1892379354e-10043.85%PREDICTED: similar to GA15926-PA [Tribolium castaneum]
NCBI nr blastxgi|1892379351e-11845.26%PREDICTED: similar to GA15926-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[89-255] IPR0133201.3e-29Concanavalin A-like lectin/glucanase, subgroup
[90-260] IPR0089852.3e-22Concanavalin A-like lectin/glucanase
[90-252] IPR0017594.5e-16Pentaxin
Orthology groupMCL14859 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203409-TA
ATGCGGGAACTATTTCTGTGTGCTTTGCTGGGATTAGCAACTGCCAGAGAGAATGACTGGCAGCCAATTGTTCCGGAATCGCCCCTACTCTATACAAGCTTTTCGAAACCGGAAAGAGCTTTAGATCCTGGTTTGTTAAAGATCGTGAACCGAAGGCCCGTCCTACGACCGACTTATAGCAGCAAGCCTAACCCTTATAGCTCGGCGCCAGCACCGCCAACTGATTCTTGCAGCGTGTATAAAGTTTCCATGAACCAAGAGTTATTCTACCAGTATGTAGAGTATGAGGCAAATCTGCCCGATTTAAAAGAATTTACCTTGTGTATGTGGAGCAAGTTTCATAATCATAGCGAAGATCATCCACTATTCTCATATTCTGTGGGATCTAATCCAAAGGAAATATCGTCCTGGATATCAAATACAAAGGAGGCTAGTTATTTCAGTATGGCGGTGCACGGACAGACGTTCTTTAGGCTAAATTACCCTCTGAAGCTTAACACTTGGTATCACTCTTGCCAGTCATGGAACGGTAAAACAGGAGAATGGCAGGTGTGGGTGAATGCTGAGAGAGTAGGCAGAGGTTTTCATAATCGACTTGTTGGTCATGTAATCAAAGGAGGCGGAACTAGTATAACTGGACAAGAACAATCTCTTCTGTACAAAAAAGATGGCGCCCAACCTATTATAAAACAATCCGGATTCATAGGAGAAGTTACTATGCTTCAATTGTACCATGTCGCTTTGACTGCGGGAAAAGCCCACAGGGATCACAAACATCACCACGTTCATCACTTTAAGCACGACGGAACTCCTTTGGAGAGTACATCTGAGCCAGCCACTGAAGCACCTGAACCTCCAGCGATTCCGATTGGTAACGGTAACTTTTTAATCGGAGGTCAATTGCAAAGGCCCGCTGACCTAAATTTGGCCCAACCTCAACAAATGGTGCCCGCTAAACTGGCCAATGGTCTACAGTTTCAACAAGAGTATGTCAATGGACAACTAGGAAACAGAATAGTTTCAGAGCAACTACTGAACAGTGTGCAATCTTTGCAAACAAACCAAATTTATGGAGCTAGTAATAAACAGTCATCGTTCCTCGCCTTACCAATATCAACATCAGTGGGTCAAAGTCAACGAAATCCAACAAGAACCTTTGGGCCAGGAAAGGGTACTACGGTTCTTCTGTCAGGATCTCTCATTAATCCTGCTAATGTTCAATATATCGACGATATAAATAATTCGCATAATCTATACAAAAGGGATTCTAAAAAGCGCGACAAAAGAGATAATCCAGAAAAAGAAGTATCTGATGATTTAAGGAAGGGAAAAAAAGACAAACGAGGACTCGTTTCTCTGTCAGACGGATCCATAGTCGATGAAGCTCTACTGAGTCCAGATCTTTTTGATACCAAAGAAGACGAAATTCTCTTCCAATTATCACTTCAGAATGGCTTAGCTGGTGTTGTAGGAAATCAACCGGTCGATGAAAGAGAACCCGCAGAAGCAGAAGTTAAAGCAGTAATGGAAGTATGCAGTGGTTGTACTCCAGAACCGTTCAAGAAAGCGCTTATTTTGTCCTGGAGAAATACACCGAAGAAACTTTACAGTGGGGCACATTATTACAAAGGGTTACCAATTTGTCGAGCATTCTAA

Protein sequence:

>DPOGS203409-PA
MRELFLCALLGLATARENDWQPIVPESPLLYTSFSKPERALDPGLLKIVNRRPVLRPTYSSKPNPYSSAPAPPTDSCSVYKVSMNQELFYQYVEYEANLPDLKEFTLCMWSKFHNHSEDHPLFSYSVGSNPKEISSWISNTKEASYFSMAVHGQTFFRLNYPLKLNTWYHSCQSWNGKTGEWQVWVNAERVGRGFHNRLVGHVIKGGGTSITGQEQSLLYKKDGAQPIIKQSGFIGEVTMLQLYHVALTAGKAHRDHKHHHVHHFKHDGTPLESTSEPATEAPEPPAIPIGNGNFLIGGQLQRPADLNLAQPQQMVPAKLANGLQFQQEYVNGQLGNRIVSEQLLNSVQSLQTNQIYGASNKQSSFLALPISTSVGQSQRNPTRTFGPGKGTTVLLSGSLINPANVQYIDDINNSHNLYKRDSKKRDKRDNPEKEVSDDLRKGKKDKRGLVSLSDGSIVDEALLSPDLFDTKEDEILFQLSLQNGLAGVVGNQPVDEREPAEAEVKAVMEVCSGCTPEPFKKALILSWRNTPKKLYSGAHYYKGLPICRAF-