Monarch geneset OGS2.0

DPOGS210373
TranscriptDPOGS210373-TA1275 bp
ProteinDPOGS210373-PA424 aa
Genomic positionDPSCF300025 + 664826-667273
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0138560.083.25% 
BombyxBGIBMGA011937-TA0.073.56% 
DrosophilaCG3085-PA3e-6634.11% 
EBI UniRef50UniRef50_Q9W1V25e-6434.11%CG3085 n=29 Tax=Endopterygota RepID=Q9W1V2_DROME
NCBI RefSeqXP_317195.17e-7035.53%AGAP008275-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|312225741e-6835.53%AGAP008275-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|312225745e-7435.65%AGAP008275-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00002267.5e-78microtubule cytoskeleton organization
GO:00058747.5e-78microtubule
KEGG pathway 
InterPro domain[20-399] IPR0004357.5e-78Tektin
Orthology groupMCL25903 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210373-TA
ATGGACTCGTCTGTTCCAGTTTATGAAAAGCCTCATCCGTCTTTAACCGTTTTAGACTGGACAAGAAATATACAAAGATTGCAACATGAGGCAAGATGTCGTCGATTTGAATCATATGAACTTAGACAGACAGCTAATCAACTCCGTAATGAAACTTCTGTAACTACACGATGGGATAATTATGTTAACAATGAGTTAATGAGAGATAGAATATTCGAAGTTACGTCGTTTAGAGAAAAGCAAAGATTTACAAAAGAGCAAGTTAGAGATGAGACGAGAGCACTAAAAGAAGAAAAGAATAACACTGAATTGTATTTAGAATCTCTACAAATACCGCTTATGGTTGTGTCACAGTGTCTCACTAACAGGGACACGAGAGTTCCCTCTGAACTAACCAGGGATCAGCTGGGAGAGGAATTAAGAAAGGAACTACACATTATCGAAAACAACAAGCGTGTTCTAACAGACGTCTGTCACATGGGTTGGGAGAAGATAAGGGACCTCAACAACGTGTTCTGTAGACTTGAGAGAGAGATTCAGAATAAGGATGACGCATTGGAACTTGAGTACCAAGTTAAGGATCTTAACAGAGACTGTTCTGATATATCGTACAAGATGGACCCAACAAGGATACCCCCGGATACAATGACAGAGGAGAGCTATGTCCACTGCATAGAGAATACTATACAAATAGCGAATCAACTAATGAGGGAGTCCAAGACCTTACGAGAGACTATGTTCAGGTCCTGTGAACAGGCCCGCAATCAGATGTACGACCAGAGCCAGCGAGTGGAGGCGGTCATGAGAAGACGAGTGTATGACGTGCAGAGAGCGCGGAACGAAATGGAATGGCAAAAATATAAGTTGGAAACAAACCTTCAAAAACAAGATCGCGAAATAGAGGCCCTACGAGCCGCTGTCGCAGATAAAATAAATCCAACAAAATTAGTACACACGAGACTTGAAATACGCACTAGACGACCTGTTTTGGAGAGAGTGGAGGATAAACCCATGAGAGGTTTGATAGAAGAAACAGAAAGAGTCCAAACCAGCTCTAAGTTATTGGAACAGAAATTGGAAGATTCTTTATCTATATACCACGGAATGTATAATCATTACGAACGACTAATTAAAGATCTTCAATATAAGAATCAAGCGCTGGAAACAGACCGACGGTTGATCGAAATAAGAAAACCGCTACACAAAGACAACGAAACTGACTTCAAACGAAACGTCGGTTACTGTCACATGTCAGATGAATTGGTCACTGATTAA

Protein sequence:

>DPOGS210373-PA
MDSSVPVYEKPHPSLTVLDWTRNIQRLQHEARCRRFESYELRQTANQLRNETSVTTRWDNYVNNELMRDRIFEVTSFREKQRFTKEQVRDETRALKEEKNNTELYLESLQIPLMVVSQCLTNRDTRVPSELTRDQLGEELRKELHIIENNKRVLTDVCHMGWEKIRDLNNVFCRLEREIQNKDDALELEYQVKDLNRDCSDISYKMDPTRIPPDTMTEESYVHCIENTIQIANQLMRESKTLRETMFRSCEQARNQMYDQSQRVEAVMRRRVYDVQRARNEMEWQKYKLETNLQKQDREIEALRAAVADKINPTKLVHTRLEIRTRRPVLERVEDKPMRGLIEETERVQTSSKLLEQKLEDSLSIYHGMYNHYERLIKDLQYKNQALETDRRLIEIRKPLHKDNETDFKRNVGYCHMSDELVTD-