Monarch geneset OGS2.0

DPOGS205047
TranscriptDPOGS205047-TA2043 bp
ProteinDPOGS205047-PA680 aa
Genomic positionDPSCF300074 - 586480-592127
RNAseq coverage579x (Rank: top 22%)
Annotation
HeliconiusHMEL0156127e-7258.60% 
BombyxBGIBMGA006925-TA0.075.88% 
Drosophilamilt-PD8e-7235.23% 
EBI UniRef50UniRef50_E2BK456e-9036.87%Selenide, water dikinase n=7 Tax=Formicidae RepID=E2BK45_HARSA
NCBI RefSeqXP_393589.25e-9338.03%PREDICTED: similar to milton CG13777-PA, isoform A, partial [Apis mellifera]
NCBI nr blastpgi|3071861203e-9536.60%Selenide, water dikinase [Camponotus floridanus]
NCBI nr blastxgi|3071861206e-10337.41%Selenide, water dikinase [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[25-176] IPR0069335.4e-18HAP1, N-terminal
[271-364] IPR0221547.7e-10Trafficking kinesin-binding protein domain
Orthology groupMCL11418 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205047-TA
ATGGAACATCAGAACTTACGTGAAGTTAACTATATAATATTAACAAACGACACAGAGGATGGTTCGCCAACAGCTGAAGAGCAAGAGGCGGCTACTGCGATCGCTCTGAAGAAACGCACAGGCGCCCTGGAGAGGGAGAACAGGGCGCTCAGGGATGAGGCGGCGCGCCTCGCAGCCGGAGCTGACAGCGCCGAGCTCGCTGAGAGACAACTCCTCAGGGACATCGCCTCACAGCTTTCCAGTGCTAACTCTGAAGCATCGGTGTTAAGTTCGGAGTTAGCTGAAGAACGCCAACGTTCCACTGATCTCCAGCATCAATTGGATTCAACCAGCGCAAGGCTCGCGCTCAGTGAACGAAGCTTACAACAGTTGACCGTAGAACACGAGCACACTATACGAATATTGGAAATCACAAAAGATAATCAAAATGCGCTCGCATCAGAATTGGCAGACGCTAAGGAGCGGTATGCAGAGGTGGCAGCACTCTTGGCGGAGGCACAGGAGCAGCTCCGCGCGGTCCGTCGTCGCGGTGAGAGCACCCGCGGCCTCATGCCCTCCGTGGCCGCCGCTGCCGGTCTCCTACCAGCCAGCCTACATCGTGAGATGCATTCCTCCGTGTACTCCGAGCTCAGTCTCGACTCGGGTATCGGAGACCCGCTCGCACATTCAAGCATGCAGAAGGTGTTTGAAACGGTGCAGTGTGCATCTCGCTGGTCGGGTGCTTCACTGTCTGGCTCCGAGGACGATGTCGCTCCCCGCGTCTTCAAGCCGGTTCCTAAAACCACCTCCGACTCCTTCTTTGAAGATACATCGGACACCGAGTCTGAGGATTTGTACCCCGGTAACGCAGCAGTCGGCGTTCCTGGAGCACCGGGTGCCGCGGAGCTAGCAGCGGCCCTGAGACGTCTCACTCCACAAGAGATCAATTCGCGGCGCGCTTCACTAGCGGCGTCGCATTTGCTACATCAACGACGACGATCTGATAGGGATGCCAGTGAAGAGAGTGCGGTATGGGGAGCAGGTGCGGCAGGCGGGATCGCGAGGTTCCGCGCTCCTCATAAGTTACAGATCGTTAAGCCTATGGAAGGCTCCCTTACGCTGCACACTTGGGCTCAGCTCGCTAAGCCTAATATGTCTGGCTTATTAGAAGAGCAAGAAGGTGTTGGTGTGAGGGGCTCACGATCTGCACAATCGTTAGGCATGAGGGTATACAGGTTATCGGATGTAGAAGAAGATGATGACATGCCGCGGCTACCACACTCCAGTCATATATACACATATACGAACAGCACGGTCTTACATCCGAACGATGGCAGCCTCGTAGGTAGCAGCGTGAGCAGCGTGTGTAGCAGCGGTATGAGCAGTCTATCCAGCAGTGTGTTGGGCAGCGCCTGGAGTTCGCGCCTCACATCACGCCGGTCTTCCGCCGCGGTGTCACCGGTTCACTCTCGCCGAGAGTCGCTATGCGTTCCAGTCCAGCGCTCACACTTCACACCGACGGCCACACCCGCTAACAGCCCCCTGTTAGGCTCGCCGGATTCTTCACCTCCTCCCACACCGCGGCCTGGGGACGCCCCGCCCTCCCTGCATGCTTTGATAGCGAGCGGTACATCAATCCTTCGGCGGCGATACCTAACTTCTCCGAACTCCGACGGTTCACCGGCCCCGCTCGCTCTGCAAACCCCGGGTTCTCTTTATATGGGTCTCGTCCATCGCAGTCCGATGGAACAGCTGACGTGCCTGAAGAGAACGCTTCGATCGCCCGCTGCCGCGCCCTCCGAACGGTCGGATGACGCGGATGCTCCACTGGGGGTGCCCGCTCACCCGGGTGAGGGAGCCCTAGACGTGGCTGCCTCGATGGGGTTGGGATGCGTGACGGGGCCTTCAAACGGAGCGCGGAGGCCACGAGTAAGAGCACCTCGACCCCGTACCGACTTGGGCACGGTGGGAACGGCGCCCGCCGCTAATAATGTACCAGCGCATTCATCTCCCCTGGGCACTCTCAGCACTTTTCTCTTCGGTCGGAAAGGTGGCCTTCTGTGA

Protein sequence:

>DPOGS205047-PA
MEHQNLREVNYIILTNDTEDGSPTAEEQEAATAIALKKRTGALERENRALRDEAARLAAGADSAELAERQLLRDIASQLSSANSEASVLSSELAEERQRSTDLQHQLDSTSARLALSERSLQQLTVEHEHTIRILEITKDNQNALASELADAKERYAEVAALLAEAQEQLRAVRRRGESTRGLMPSVAAAAGLLPASLHREMHSSVYSELSLDSGIGDPLAHSSMQKVFETVQCASRWSGASLSGSEDDVAPRVFKPVPKTTSDSFFEDTSDTESEDLYPGNAAVGVPGAPGAAELAAALRRLTPQEINSRRASLAASHLLHQRRRSDRDASEESAVWGAGAAGGIARFRAPHKLQIVKPMEGSLTLHTWAQLAKPNMSGLLEEQEGVGVRGSRSAQSLGMRVYRLSDVEEDDDMPRLPHSSHIYTYTNSTVLHPNDGSLVGSSVSSVCSSGMSSLSSSVLGSAWSSRLTSRRSSAAVSPVHSRRESLCVPVQRSHFTPTATPANSPLLGSPDSSPPPTPRPGDAPPSLHALIASGTSILRRRYLTSPNSDGSPAPLALQTPGSLYMGLVHRSPMEQLTCLKRTLRSPAAAPSERSDDADAPLGVPAHPGEGALDVAASMGLGCVTGPSNGARRPRVRAPRPRTDLGTVGTAPAANNVPAHSSPLGTLSTFLFGRKGGLL-