Monarch geneset OGS2.0

DPOGS208326
TranscriptDPOGS208326-TA1026 bp
ProteinDPOGS208326-PA341 aa
Genomic positionDPSCF300383 - 84287-87292
RNAseq coverage650x (Rank: top 20%)
Annotation
HeliconiusHMEL0139871e-1588.37% 
BombyxBGIBMGA004038-TA5e-16684.14% 
DrosophilaEb1-PF8e-7188.89% 
EBI UniRef50UniRef50_G6CLG10.0100.00%Putative uncharacterized protein n=3 Tax=Coelomata RepID=G6CLG1_DANPL
NCBI RefSeqXP_002066049.14e-12058.72%GK22141 [Drosophila willistoni]
NCBI nr blastpgi|3320279143e-12563.10%Microtubule-associated protein RP/EB family member 1 [Acromyrmex echinatior]
NCBI nr blastxgi|3320279142e-12463.10%Microtubule-associated protein RP/EB family member 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055151.1e-64protein binding
GO:00080176.7e-18microtubule binding
KEGG pathway 
InterPro domain[6-131] IPR0017151.1e-64Calponin homology domain
[277-317] IPR0049536.7e-18EB1, C-terminal
Orthology groupMCL11606 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208326-TA
ATGGCAGTGAATGTGTACTCCACAAATGTGACGTCGGAAAATTTATCAAGGCATGATATGTTGGCGTGGGTGAACGACTGTCTTCAGTCGAACTTCGCCAAAATCGAAGAGCTCTGCACGGGCGCCGCTTATTGCCAGTTCATGGATATGCTGTTCCCTGGCAGTGTACCAATGAAAAGAATTAAGTTCAAGACAAACTTAGAGCATGAATACATACAGAACTTTAAAATTCTTCAAGCCGCTTTCAAAAAAATGTCTGTAGATAAGGTAATACCCGTGGACAAGCTGATAAAGGGGAGGTTTCAAGATAATTTTGAGTTTTTGCAGTGGTTTAAAAAGTTTTTTGATGCAAACTATGATGGAAGGGAATATGATGCGTTTGACGCCCGTGGTGGGCTAACTATTGGTTCAGGGGCATGTGAAAGCGGGGTGCCTCTGTGTGTTTCCGCAGCGCCTCCAAGAATAGTACCCATTGACAGACTGGTGAAGGGTCGGTTTCAAGATAACTTTGAGTTCTTGCAATGGTTCAAGAAATTTTTTGATGCCAACTATGGAGGCACGGAGTACGACGCGATGGCACAGCGCGAAGGCCTGCCAATGGGTCACGGAGCCCCGGGAGCTCCGCCTAGGGCTGTTCCCGCTGCCGTTAAGAAACCGACAGCTCCTGTCGCTAAAGTCGCTGCCAGACCCCAAACCATTGGGAAGGCAAATAGCACAGTGAGGTCACCGCCAATGAATCCATCGAGATTAACTCAAAGTGGTAAAGGAGATTCTAAAGTAATAGATGACCTAAATCATCAGATAAATGAATTAAAAGCAACTGTTGACGGATTAGAAAAGGAAAGAGATTTTTACTTTGGTAAGCTTCGGGATATAGAAGTAATCTGTCAAGAGATGGAGGACCAGCATAATGCCCCAATAGTTCCAAAGATATTGGACATTTTATATGCAACTGAGGACGGATTCGCGCCACCGGAGGAGGAAGACGGTGACAACCCTCACCCGCCTGAAGAGGACGAATATTGA

Protein sequence:

>DPOGS208326-PA
MAVNVYSTNVTSENLSRHDMLAWVNDCLQSNFAKIEELCTGAAYCQFMDMLFPGSVPMKRIKFKTNLEHEYIQNFKILQAAFKKMSVDKVIPVDKLIKGRFQDNFEFLQWFKKFFDANYDGREYDAFDARGGLTIGSGACESGVPLCVSAAPPRIVPIDRLVKGRFQDNFEFLQWFKKFFDANYGGTEYDAMAQREGLPMGHGAPGAPPRAVPAAVKKPTAPVAKVAARPQTIGKANSTVRSPPMNPSRLTQSGKGDSKVIDDLNHQINELKATVDGLEKERDFYFGKLRDIEVICQEMEDQHNAPIVPKILDILYATEDGFAPPEEEDGDNPHPPEEDEY-