Monarch geneset OGS2.0

DPOGS208354
TranscriptDPOGS208354-TA4989 bp
ProteinDPOGS208354-PA1662 aa
Genomic positionDPSCF300251 - 27835-41271
RNAseq coverage139x (Rank: top 55%)
Annotation
HeliconiusHMEL0037640.072.81% 
BombyxBGIBMGA000067-TA2e-9766.79% 
Drosophilanrm-PE1e-14340.34% 
EBI UniRef50UniRef50_E0VDC01e-16346.99%Vascular cell adhesion protein 1, putative n=1 Tax=Pediculus humanus corporis RepID=E0VDC0_PEDHC
NCBI RefSeqXP_002424114.12e-16446.99%Vascular cell adhesion protein 1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420065535e-16346.99%Vascular cell adhesion protein 1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420065534e-16946.90%Vascular cell adhesion protein 1 precursor, putative [Pediculus humanus corporis]
Group
KEGG pathwayecb:1001464023e-18 
 K06467 (CD22, SIGLEC2)maps-> Cell adhesion molecules (CAMs)
    B cell receptor signaling pathway
    Hematopoietic cell lineage
InterPro domain[428-523] IPR0137835.3e-13Immunoglobulin-like fold
[25-110] IPR0130984.2e-08Immunoglobulin I-set
Orthology groupMCL16159 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208354-TA
ATGATTGAAACGAAGGAGCATCCTATACACGCCACGAGAGAGCTTGAAGTTTGGTACCCGCCGTTAGTTCATGTAACGCCGCCGAACATTACAATCGTGGAGGGATCCAAGATTTTGTTGAAAAGCGAATATGAGAGCAATCCGTCTTCATTAATCGAAGTAATCTGGTACCGTGATGGTATGAAAGTCAATGTGAATAAAAGTCACTACCAAGGTGGCAATACCGACCAACACTCGCTCATCATCTTAGACGCCAATGGAGAAGATATGGGGAACTACACTGTCCTCCTTGCCAACGCTGTTGGGAATGGCACCACAAATGAGACTATCAGCGTCAATGTTTTATACAAACCTCAAGTACGTCTTACAATGAGTTCTCCCAGCCCAATCCTAGAAAGAGAGCACAAGAACGTGACGCTGACCTGCGAGGTGGTTTCCGGGAATCCTTCTATCCTGGATGAAGTTATTTGGTTTCTGGACGGAGAGGTTCTCAAACATCTGCCTGAATGTAATGATACCGATGCATTACTCTGCAATGAAGTAGACCCATCCATGTTACTCCTTCAAGATACAACAAAAAGTTTTCACGGAAACTACTCCTGCAAAGGAAAAAACTACGCAGGTTGGGGAAATGAGAGTGAAAAAACGGAACTTGTTGTTAACTATCCTCCCGGTCCAGCCAAATTAACTTACTCCCCATGGCAGGTTGTGAAGGGAAAGTCCCTAGTTCTTAGCTGCAAAATAGAAGAGAAGGGGCGACCAGAAACACATAGATATCGTTGGTACCGCGGCGGACGTCCTGTGACTGACATCGTCTCCCAGAATTGGACGATAGACCCTGTAACTCTTGACCATCGGACTAACTTCTCTTGTCGTGGAATCAACGCTGGCGGAGAAGGAGAGGCAGCCTTTGTCAATATAGACGTATTGGCTCCTCCTAGCTTCAAATACGCAATGAACCACTACAGCGGTGCTTTATACAAATCACAGAACATATCTCTATCATGTACTGTAGAGTGTGCTCCGCTGTGTAGTGTTGTGTGGCTCAAAGACGGACAGATTATTGACCCCGAGAAAACCGATAGATACTACGTCGAAGAACGAAAAATTGAACCTCAAGTTAATAGAAACGACTTCGAAGCTACTGAATCTACTTTGCATTGGAACATGTCAGCGTGGTCAGGAGGCATGTTGTCCCGCGGTGACTCCGCCCGGTACACGTGTCGCTCCGGAAGGAACGCTGCCGGACCACCAGTCAACAGTACCACCAAGTTTGCTGTTGAATATGAACCGGAAAATATTACAGTTACACCGGAAGTGGTGTCGGTTGTGGAAAATGAAATACCAGCGAAGGTTGTATGTTCAGCCAAAGGGTTTCCAATGCCGAGCTACAGTTGGCGTCGTCTCACTCCTCACAAGTCCTCATCTCGTGATCACAACTCCTCACTGATTCTATCATCATCGAACGCTCTCCTCCTGGGGCCGGCGGCTCGACGACACGCTGGGCGCTACGTCTGTGAAGCTTACAACAGACACGGCTTCATTAACACTAGCGTGTTACTAGATGTCATGTTTATTCCAGAATGTGGCATAAAACAGATCGAGTTGAATGGAGAGCAGGTGTTAGTGTGCACCGCCCACGCGAACCCGTCTGAGGTATCATTTACATGGAAGCTGAAGAATGACAACGACAGTTTGACAGACGAGAAGATCTGGCAGAGAGGAACACAGAGCTTCCTTCGTCTGCCAGCAGTCGAGGTCTACCGGACCTACCTCTGCAGAGCGAACAACAGCGTGGGCGCTTCCAGACCTTGCGAGAGAGATGTCATGGGTACTAAAGTGTGGTGGAGAGATCAGCACAAGCTGATGTTGATTGGCGGTGCCGCCCTGGCGTTGCTCGTTCTCTTTGTCATTCTCTGTGCCATCATCATCTGTGTTTGCAGACGGATGAGAGCCAAATCTAAATATAATAATCCAGTCGAGTTAGAAGAGCGGGAAAACACCCTTGGTCCGAGCAGATTCTTGGCACACATTGTCAATGCCACCCTCGAGGTCGCCCCTTATAGTAACGCAGTCGTTGACATAGGTCAAGGACGTACTAAGGTTCCTAACCGCCCTAACACCAGATGTTTTGAGACTGACAACAATAAACACTTCAGTCCGGATGGTAGCTGTCTAAACGAAGTGTCCATAGCGCAATCTATAATATCACAAGCTTTACCCGCTTTGTCACCGCGTATCTTATCATTCAGTCCGAAATCTTCAGATAGAACTAGACTTGTAAACTCACAGAGTTATCCTCATAGCATAGAAATGACTGTCAAAGAAAGCTCCAATAAAATAGATCTAAATATTAATCAAGACGCAACAGAAAATGTCGAGACACCGAAAACGGAACCTAAAACAGATGTAGATAAGATCCAAACTGTAGTGACTCCAAATAAATGGCCTCTTAAACCTGGAGTTTTGGTGCACGTTAACTCAAACCATACACTTAGCCCTAAAAATCAAGCGAGATTACAAAATAGAATACAGAATGAGAGTAATGAAATAAGCAATCTTGGCTTATTGGAAAGGAAACCAAATATAGATGATAAAAGCCCACCAAAGACTGAATATTCAAAGAAAACTGGAAAACCAAAAAAACATTCTCAAAAAGTAACTGGCATACTTAAAAATCTAAGACGCAAGAGTGATTCATCGGATGAAGAAAGTGGAAAATATAAAAAGATAAACAAAACAAACAAAAGAGCTGTAACACTAGTGACTCACAATAAAGGCTTTGGGCATAAGAGAAGTATTAGATCGTTCTTTCAAAGCGAAACGCCAAATGTGATAGTCACGGAAGGAGTCGTATCATTTAAAAGACCTGATAGAATAGCGAATGACAAAGTACCTATAAAAGCACCGAGAAACACAAGAAAACGGAAGAAACCCGGAGATAGCCCAGCTACGAATGGCGTAGATAACGCGCCAATCACGGAACCCGGGCTGTACGAGAACTTGCCCTTCCACGGATTACAGCAACCTCCTAATAAGCCTGTGCAAGCGATACAACCTAGAATTGTCGAAACTAACCAAAAAGCCACTAAAGGCATTCAGGCGCTACAAAAACAATTGTCAACCAACCCATCATTCACACCCCGTGTCGTTTGTCCGCCGTTCGTACAAAACGCAATCAGTTACCCGGTTACGTCGAATACTGAATTCCCAACTTACGGGATACCAGTTTTATCACCGATTCAACAAATGTACCCATTGCATCAAACTGTCTTATTAAACCCGTACTTACCACAAATGCAAACATCGTTTCTAAAACAATTTCCTAGAATAGATGAAGAAAAGTTCGAATCTGAAACAAAAAAATTTTGTTCCTTAAATACGAGGAATGTAAAAAATAAACCAAAGTTTCAATCTATGAGGATTGTTAGAAAGAGAAACATCGAAAGATTTTATCCAAATACAGAGTATGGTGAGAATTTTAACGAGATTAATAACAACATAAATGAGAATGATGATAAACTGGAATCCAGCGGTAACATCTATGAAAACTATCCCACACTATTGAAAGTTAATTCTGACATAAGCGGTGTGGAACAGTTTGAGAACTTAACTGATTCCATACAAATCTGTGAAGAAAGCAGCGATGGAATAAATGCATCAACATCTTCTTTAAATAGACCCGTACCAGCTCCCAGAACTAAACTGACGCCGGTCACATCACCTGCAAAATCTGATCATGTCTACGTTAATCTTTCAATGCCACTCATAAACACAAGTTACATGCAATCATATGACGTCACTGATGGCCCGGTCAGGCCAACTAAGTCAGAAGAAAACGTTTCCAAACATATCCAAAGAAACCAAGCCAAAATAATACCGATGGTGCCAAAAAGAACGATAAGTCTAAAAAATATTCACGAACAGTGCGATAACGATCAGAACGCGATCAATGAGGAAAAAGAATTGACGAAACCGAATATCATCGGTGCAACACCTAAAAGAAATACTTCGATAACATTACCATCAACCTCACTAGTTAAATCTGTTGTCAATCAACTAAACGATCAAGGTATGAATAAGGTTGGGCTGCCGAAGAAGCAAGTTATATTACCAAACAAAACCTTACAAATACCCAAACTGTCTGAAAAACCGATACCAAAAAGAAATTTAGTCCAACGTAGCAATTCAGTCCAAATTTGTTCTAATAATAGTGATATCGCTCAGAATACTAATATACTACAACAACAATACCTTCAGAAAAGAAACCATTTTAGCAATATATCCAAAGGAAAAAACAAGAAAAACTTCCAAATACCCTTACAAAAACACCACAGCTTTTGCTACTTCCAGCCAATTAGAGTAGATAAAAGAATTCCGGTAGATCAAGAACTATCATATAACAACACAATAGGTCCAGTCATCTACCAGCATCCAGAAACTCAGTACTACACAAAAACATTAAATAGAACAAGAGAAAAAAACAGTCAGAAACTAAATCGCCTTAAAAAATCTGAAAGCTTACGTGAGAAACTGTCATGCGTCGGCGTTTACGATAATTACGAGAAACCAAATTATCCGGAACCAGATTACAACGAATCAGAACAAAAATCATACCTCCAGTTACTCAAAGGGAATTACATATCAAACTCCACAAGAAACCTAGCACCTGAATACAATTACAACACTAACTTCAATACGATGAACCCTAAAGAGGAGAGACCTAAACATAAGAAACTAGTGTACGCAGATTTAGCGTTAGCGTACAGCAAAGAATACAATAACGATTTTGAAAGGAAAGATAACATGAAACGTCTGTTACCTTTCCTACCAGAAGATCAGTTTATTTACGCAGACGCCGATATGAAGGACTACGGGCCGATCAATTACAAAGCAGCATCAATATACGCTCAGATGAAGAAAGTCAAAGATCAGCAGAAATTACAGGAAATATACGAAAAAAGCCATCCCGAGGAGGACATGTTGTGA

Protein sequence:

>DPOGS208354-PA
MIETKEHPIHATRELEVWYPPLVHVTPPNITIVEGSKILLKSEYESNPSSLIEVIWYRDGMKVNVNKSHYQGGNTDQHSLIILDANGEDMGNYTVLLANAVGNGTTNETISVNVLYKPQVRLTMSSPSPILEREHKNVTLTCEVVSGNPSILDEVIWFLDGEVLKHLPECNDTDALLCNEVDPSMLLLQDTTKSFHGNYSCKGKNYAGWGNESEKTELVVNYPPGPAKLTYSPWQVVKGKSLVLSCKIEEKGRPETHRYRWYRGGRPVTDIVSQNWTIDPVTLDHRTNFSCRGINAGGEGEAAFVNIDVLAPPSFKYAMNHYSGALYKSQNISLSCTVECAPLCSVVWLKDGQIIDPEKTDRYYVEERKIEPQVNRNDFEATESTLHWNMSAWSGGMLSRGDSARYTCRSGRNAAGPPVNSTTKFAVEYEPENITVTPEVVSVVENEIPAKVVCSAKGFPMPSYSWRRLTPHKSSSRDHNSSLILSSSNALLLGPAARRHAGRYVCEAYNRHGFINTSVLLDVMFIPECGIKQIELNGEQVLVCTAHANPSEVSFTWKLKNDNDSLTDEKIWQRGTQSFLRLPAVEVYRTYLCRANNSVGASRPCERDVMGTKVWWRDQHKLMLIGGAALALLVLFVILCAIIICVCRRMRAKSKYNNPVELEERENTLGPSRFLAHIVNATLEVAPYSNAVVDIGQGRTKVPNRPNTRCFETDNNKHFSPDGSCLNEVSIAQSIISQALPALSPRILSFSPKSSDRTRLVNSQSYPHSIEMTVKESSNKIDLNINQDATENVETPKTEPKTDVDKIQTVVTPNKWPLKPGVLVHVNSNHTLSPKNQARLQNRIQNESNEISNLGLLERKPNIDDKSPPKTEYSKKTGKPKKHSQKVTGILKNLRRKSDSSDEESGKYKKINKTNKRAVTLVTHNKGFGHKRSIRSFFQSETPNVIVTEGVVSFKRPDRIANDKVPIKAPRNTRKRKKPGDSPATNGVDNAPITEPGLYENLPFHGLQQPPNKPVQAIQPRIVETNQKATKGIQALQKQLSTNPSFTPRVVCPPFVQNAISYPVTSNTEFPTYGIPVLSPIQQMYPLHQTVLLNPYLPQMQTSFLKQFPRIDEEKFESETKKFCSLNTRNVKNKPKFQSMRIVRKRNIERFYPNTEYGENFNEINNNINENDDKLESSGNIYENYPTLLKVNSDISGVEQFENLTDSIQICEESSDGINASTSSLNRPVPAPRTKLTPVTSPAKSDHVYVNLSMPLINTSYMQSYDVTDGPVRPTKSEENVSKHIQRNQAKIIPMVPKRTISLKNIHEQCDNDQNAINEEKELTKPNIIGATPKRNTSITLPSTSLVKSVVNQLNDQGMNKVGLPKKQVILPNKTLQIPKLSEKPIPKRNLVQRSNSVQICSNNSDIAQNTNILQQQYLQKRNHFSNISKGKNKKNFQIPLQKHHSFCYFQPIRVDKRIPVDQELSYNNTIGPVIYQHPETQYYTKTLNRTREKNSQKLNRLKKSESLREKLSCVGVYDNYEKPNYPEPDYNESEQKSYLQLLKGNYISNSTRNLAPEYNYNTNFNTMNPKEERPKHKKLVYADLALAYSKEYNNDFERKDNMKRLLPFLPEDQFIYADADMKDYGPINYKAASIYAQMKKVKDQQKLQEIYEKSHPEEDML-