Monarch geneset OGS2.0

DPOGS201986
TranscriptDPOGS201986-TA1518 bp
ProteinDPOGS201986-PA505 aa
Genomic positionDPSCF300060 + 16997-22139
RNAseq coverage54x (Rank: top 69%)
Annotation
HeliconiusHMEL0040386e-9653.21% 
BombyxBGIBMGA010554-TA6e-2525.37% 
DrosophilaCG15630-PA2e-1921.78% 
EBI UniRef50UniRef50_C3YY391e-2127.09%Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YY39_BRAFL
NCBI RefSeqXP_002003379.16e-2022.44%GI17882 [Drosophila mojavensis]
NCBI nr blastpgi|2608083815e-2127.09%hypothetical protein BRAFLDRAFT_122460 [Branchiostoma floridae]
NCBI nr blastxgi|2608083812e-2026.88%hypothetical protein BRAFLDRAFT_122460 [Branchiostoma floridae]
Group
KEGG pathwaybfo:BRAFLDRAFT_1224608e-22 
 K06491 (NCAM)maps-> Cell adhesion molecules (CAMs)
    Prion diseases
InterPro domain[261-331] IPR0137835.7e-21Immunoglobulin-like fold
[138-236] IPR0035999.4e-12Immunoglobulin subtype
[133-216] IPR0130981.2e-11Immunoglobulin I-set
[144-218] IPR0035984.6e-10Immunoglobulin subtype 2
[320-470] IPR0089575.1e-10Fibronectin type III domain
Orthology groupMCL34427 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201986-TA
ATGATTTGTCAGTTTTGGAAACAAGAAGAAGTAAAAAGGAGAAAAACTGTTTATGGTTTAAAAGCTATCCGAAAAGAAGTCCAGGGCCTAACAATTGAGCCGATATTTTATGGTCCAGATAAAAATTATTTTGAAGCGGGTACTCGCAAATCAATAACATGCAAAGGTGCAAATAAAAACCAAAAGGTAGAATGGGTAGATCCATCAGGCAAAGTAGTGCAACGGTTAGCAACCAACAGGGTTTTCACTCAGGAGCATTTCGTATCTACCTTCAGGTCTAGAGTACCAGCAATGGTGCTAATTCTTACCAAAGCGACAGTGGAAGATACCGGAGTGTGGCAATGTAGATCAGGAGATTTTAGGCAAAACGTCTCCTTATGTATTATAGAGCCATCAGAGTTTCTGGAGACTCCAACAGAGGTATCTGTGGATCGTGGCAGATCCATAACCCTGTCCTGTCAAGCGAAAGGGGAGCCCGAACCGCGCTTGGTATGGTACAGACACGACACTATCATTAGCGATGATTATAATCCATCGAAGTACCAAATAATGACCAAATACAACAGTGAGGGTTTTGAAGGTTTGCTGACCATCACGTCTTTGGAAGGTGAAGACAGTGGTGTATACAATTGCTTCGCAATCCAAGAAAGCTCTTACGTAGATGGATGCAGCGCGAGCATCAGTATGAATATCACATTACATGTTAACTATGCACCAACATTTAGCGACGGCAATGATACAACGTTGGTTCCTGTGCAAGAAAAAAAGAGTACAGACTTAGAATGTGTCGCCGAAGGATACCCAACTCCCACCTATAGATGGTTTAAGGAAGTCGGAGATATACTGTCAGAATTTCCTTCAAGCGATATAAAACTAGAAGACGACGGTGAAAAAGCTATTCTTAGTATAACTGCTGATGAATCTACATTTGGGCAAAGATACAAATGTCGCGCTAGTAACAAATATGGCAGTGCAGAGAAATCGTTTGCTGTTATTAAACTAGAAAAACCAACTAGGCCAACTGAGATCGTTAGTAGGGATCATGATCATGACGCATTGAATTTTATTGCGCAGTGGGATGAGGAAATATATTTTCCAGTTGAAGAATTCCAAATACAATATATTGAATCAAAATTACTGAGAAAAAAATCGGGCCAACCAAGGGAAGTGGACTGGAAAAGATCTGAAGAGGTTCTGGTAAAAAATAATGAATTTGGTGAAATGGAAAGCGGAGGAACCGTTATGCTCATTAAATTAGACGACTTAAAAGAAGAAACGGAATACTGGGTCCGTTTCAAAGCCGTTAATGACGCTGGAGAATCTGCCTGGTCGGAGCCTATATTAGCTTCAACTGTCGCTAAGCCTGAAGAAGAAATCGAAATACCAGAGGAAGGTGAAGAAACTAGTGAGCCCAAAGCTGATGCCCAAGTATCGAATGGTACGTTCTATGGTGTATTCTTTGCTGGTGGTATAATAGTTGTTATTTTAGGTGCAACGTTTTTAATTAGAATGGTTTAA

Protein sequence:

>DPOGS201986-PA
MICQFWKQEEVKRRKTVYGLKAIRKEVQGLTIEPIFYGPDKNYFEAGTRKSITCKGANKNQKVEWVDPSGKVVQRLATNRVFTQEHFVSTFRSRVPAMVLILTKATVEDTGVWQCRSGDFRQNVSLCIIEPSEFLETPTEVSVDRGRSITLSCQAKGEPEPRLVWYRHDTIISDDYNPSKYQIMTKYNSEGFEGLLTITSLEGEDSGVYNCFAIQESSYVDGCSASISMNITLHVNYAPTFSDGNDTTLVPVQEKKSTDLECVAEGYPTPTYRWFKEVGDILSEFPSSDIKLEDDGEKAILSITADESTFGQRYKCRASNKYGSAEKSFAVIKLEKPTRPTEIVSRDHDHDALNFIAQWDEEIYFPVEEFQIQYIESKLLRKKSGQPREVDWKRSEEVLVKNNEFGEMESGGTVMLIKLDDLKEETEYWVRFKAVNDAGESAWSEPILASTVAKPEEEIEIPEEGEETSEPKADAQVSNGTFYGVFFAGGIIVVILGATFLIRMV-