Monarch geneset OGS2.0

DPOGS211046
TranscriptDPOGS211046-TA1293 bp
ProteinDPOGS211046-PA430 aa
Genomic positionDPSCF300202 + 210622-234931
RNAseq coverage129x (Rank: top 56%)
Annotation
HeliconiusHMEL0029499e-6483.58% 
BombyxBGIBMGA003797-TA8e-7276.44% 
DrosophilaCG42313-PA1e-6442.41% 
EBI UniRef50UniRef50_D6WPC24e-8243.91%Putative uncharacterized protein n=12 Tax=Endopterygota RepID=D6WPC2_TRICA
NCBI RefSeqXP_968247.11e-8243.91%PREDICTED: similar to CG33515 CG33515-PA [Tribolium castaneum]
NCBI nr blastpgi|2700109492e-8143.91%hypothetical protein TcasGA2_TC016379 [Tribolium castaneum]
NCBI nr blastxgi|2700109496e-8045.09%hypothetical protein TcasGA2_TC016379 [Tribolium castaneum]
Group
KEGG pathwaydre:304476e-10 
 K06491 (NCAM)maps-> Cell adhesion molecules (CAMs)
    Prion diseases
InterPro domain[232-323] IPR0089574.1e-10Fibronectin type III domain
[155-231] IPR0137833.1e-08Immunoglobulin-like fold
Orthology groupMCL18286 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211046-TA
ATGACAGAGCCCGGGACAAAGGCTCGATCCGGATTTGCGACGAATGCGGAGAAACTTCGACAATCGACGCCTCTATGTGTTAAACCCTATCCATTCCGAACCAGCGGAGATAGGCAGCTGGCACACAACGCCACCGCTCGCGTCTTCCACAGCAACCAGAGCCTCGTGCTGCAGAAAGTAACGAGGCACAGCAGCGGACGGTACGCTTGTTCCGCACTCAACGCCGAGGGAGAGACTGTCTCTAACGAACTGCACTTCCGAGTCAAATTTTCACTATCTTCGACCTCACCCGCCGGCCTCACCTCAATCGGAGACTCGTTTCAATCTACGGAAGCGATGGCCATTATAGAGATAGTTAAATTGCTAGATTCGAATACGATAATGCTTCCTCATTTTCCCAAACCGATGGAACGACTATTGACACATGCGCCCTCGTGTCGCAGCGGCGGTGTGTCAGTGGTCGGCGCGGCGCGAGGCGAGTCCGTGGTCATCGTGTGCGAGGTGGACGCGGACCCCCCTGCAGCAGTTTTTAAGTGGAAGTTCAACAACTCCGGCGAGACTCTGGATGTGGCCGCCGACAGATACACCTCCAACGGCAGTGCTTCTAGTTTAAAATATACACCAGTAGCGGATTTAGACTACGGCACGCTCTCTTGCGCTGCATCCAATGAAGTGGGAGTCCAGGTGGCTCCCTGTGTCTTTCAAATGGTCGCCGCTGGGAAGCCACACGCACCTCGTAACTGCACCTTATGGAACCAGACGGCCGATTCAGCTGAGGTGTCTTGTGTTTCGGGTTTTGACGGAGGACTACCGCAGCACTTTTTACTTGAGGTGTACTCCGGGAACGAAGATAAGCCCAGAGTGAACCTCACAGCCGAGGAACCTGTTTGGACGGTGCGAGGGCTGGAGTGGGACGTGCGATTCAGGCTGGTGGCTGTAGCCGTCAACAGCAAGGGCCGCTCGGCGCCAGCGCGGCTCGATGATCTTCTGTTCCCCGACCCGGAGAAGAGAACCGCAACCGACAGCGGTCTGGGAGCCGGCGCGGCAAGTGCGGCGGGGGCGACGGTCGCCGGGGCGTGTGCGGCGCTCGCCCTGGCAGTAGCGGCCTGGCGCGCAGCCAGGCGACGAAGACAACCCAGGAAGCCCTCAGCCCCCTCTCTCTCACAACACAAGAATGATCACGACGATGCCGAACCCGACCTCATACCCAACAACTACTTTGCGGGCACAAACAACGGTGATGCGATCCCGAGAGGAGCCAGTTGGTCCGCCAGGGCTGTTCACTTAACTTAA

Protein sequence:

>DPOGS211046-PA
MTEPGTKARSGFATNAEKLRQSTPLCVKPYPFRTSGDRQLAHNATARVFHSNQSLVLQKVTRHSSGRYACSALNAEGETVSNELHFRVKFSLSSTSPAGLTSIGDSFQSTEAMAIIEIVKLLDSNTIMLPHFPKPMERLLTHAPSCRSGGVSVVGAARGESVVIVCEVDADPPAAVFKWKFNNSGETLDVAADRYTSNGSASSLKYTPVADLDYGTLSCAASNEVGVQVAPCVFQMVAAGKPHAPRNCTLWNQTADSAEVSCVSGFDGGLPQHFLLEVYSGNEDKPRVNLTAEEPVWTVRGLEWDVRFRLVAVAVNSKGRSAPARLDDLLFPDPEKRTATDSGLGAGAASAAGATVAGACAALALAVAAWRAARRRRQPRKPSAPSLSQHKNDHDDAEPDLIPNNYFAGTNNGDAIPRGASWSARAVHLT-