Monarch geneset OGS2.0

DPOGS200131
TranscriptDPOGS200131-TA1653 bp
ProteinDPOGS200131-PA550 aa
Genomic positionDPSCF300128 - 687607-692125
RNAseq coverage91x (Rank: top 62%)
Annotation
HeliconiusHMEL0094463e-16176.23% 
BombyxBGIBMGA002776-TA0.069.31% 
Drosophilahd-PA3e-8435.34% 
EBI UniRef50UniRef50_E1ZV752e-12143.26%Protein downstream neighbor of son-like protein n=7 Tax=Formicidae RepID=E1ZV75_CAMFO
NCBI RefSeqXP_393677.21e-13643.94%PREDICTED: similar to Downstream of son gene protein homolog [Apis mellifera]
NCBI nr blastpgi|3838511153e-13646.59%PREDICTED: protein downstream neighbor of son homolog [Megachile rotundata]
NCBI nr blastxgi|3838511158e-13446.59%PREDICTED: protein downstream neighbor of son homolog [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL15373 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200131-TA
ATGCTACTCCACAAGTTGAAAAAACGAAAAAAAGCATTGCAGGAGCGCATGCGACATCCCTCGAGTTCAGTAGCAGAAGAAGCATCTAGTCAACAACCTATTAATTTAGATTTCTCCAGAATCCTTAATGGAGACAAGAGAAAAAATCCTTTCTCAAAAAATAATTCAGAGCAAAATAAGAAAGTTAAACTCCAATCAGCACCAGTAGATGAATCAAATGATCACACTTTATTTGCACTTCTAAAGTTACCAGCAAAGACAGAAAAACCTCCGCAAGATGTTGATACAGAAAAATTATCAACATTTTCAAATCTCTTACAGAAGTTCACAGCCGAACATACAGTAGTCACTAAAGTGAAGGAAAACAAATATAAACACTTAACAATAGACTGGGCCCTTAAAACCAAGTTAAGGCTCATGTCAACAAAACCATTCCCCTGGACATCCAAACTGAAGGCCAGTGAAGAAGCATCAGGAATAACAGGATTTGTCCGATGCCTTGATACAACATCTTCATCTCTGGATACATCACCCCGGGCACGGTTTCACCAAACCTGCCTGTACTGGCAACATCCTCACCTGCCATGGGTGTCACTGTACCCACGTTCCTCGGGCAGGGTTGCCGCAACCAGTTTCATGGCCACCAATGAGGAAGTCAAGCGAGGTCTCATGACGGAATGGACGGAGAGTTTTAGGTCGTTATTTCAACTAGTCCGAGCGTTACACTGCCCATACTTCTACGTTTTATCGAATACGTTTAGCGTCTTGTTCATAGCGAGCGGTGTGTGCGGCGCCGCGGAGCCCCGGGCTCTAGTGGCGCCTACCACGCGAGGGCTCAGGCATGCTCTCAGACAAGAAGATGTTGAATTTACCATGCCTCTGAGGCCTGAAAACAAGAAGAAGCTCAACACCTCTGATGAAGAACGAGGGAAAAACTCTTCATTTGACAGTTGCTACGACACCATGGACGATGGACGGACCAACGACCAGAACTGCTCGGGAGACGAAGATGACCCTGACGAGTTCCTCTCACAAATGGGCCTTGAGACGGCGGAGCTTAGGAAGATCAATAACGCACAGGCGCGCGTAAGCCACACGGCGGAGAGCAGTGTGGATCGTTCAGCGGAGTCGCTGGTTGTGGTGAGCGGTGCTGACGCTCAAGCTCTCTTCAATTTCCTCCTGAACTGCAAATCACTGGTCGCTGGCAGCGGACCTTTAGCTGGTGTCCCGCCCACGTTACTCGCGCCCACTGCCTTCCATGGAGGAACATTACAGGCTCTTAAGGTCAAAGAAAATATAATAAATTCTGAAAATAATAGGTACTACTCTATAGAACTACGAGGACCAATTCTACCAACTACCGTCCATACATTATTTAATGTACTGAAAAATAGTACTACCTCGCAATTTAGCGCTACATTTGCACATCTCCAGCCAACGTTGGCCTTTACTTGGGCGGCGAGTAAATTAGCAGAGGAATCCTCCAAAGACAACGAACAGAACCAATTCACAAAGGCATTCAACAAAGAGAACTTATCTGATTGCGGTATAAGCGACGAAATGTTACAACTTTTCTGTTCATCAGATCCCGGTTCTATTAAATCTGTCGACAGCGTTAAATACAACTCGGAAGACGATACATACACCTGGTGA

Protein sequence:

>DPOGS200131-PA
MLLHKLKKRKKALQERMRHPSSSVAEEASSQQPINLDFSRILNGDKRKNPFSKNNSEQNKKVKLQSAPVDESNDHTLFALLKLPAKTEKPPQDVDTEKLSTFSNLLQKFTAEHTVVTKVKENKYKHLTIDWALKTKLRLMSTKPFPWTSKLKASEEASGITGFVRCLDTTSSSLDTSPRARFHQTCLYWQHPHLPWVSLYPRSSGRVAATSFMATNEEVKRGLMTEWTESFRSLFQLVRALHCPYFYVLSNTFSVLFIASGVCGAAEPRALVAPTTRGLRHALRQEDVEFTMPLRPENKKKLNTSDEERGKNSSFDSCYDTMDDGRTNDQNCSGDEDDPDEFLSQMGLETAELRKINNAQARVSHTAESSVDRSAESLVVVSGADAQALFNFLLNCKSLVAGSGPLAGVPPTLLAPTAFHGGTLQALKVKENIINSENNRYYSIELRGPILPTTVHTLFNVLKNSTTSQFSATFAHLQPTLAFTWAASKLAEESSKDNEQNQFTKAFNKENLSDCGISDEMLQLFCSSDPGSIKSVDSVKYNSEDDTYTW-