Monarch geneset OGS2.0

DPOGS205137
TranscriptDPOGS205137-TA1614 bp
ProteinDPOGS205137-PA537 aa
Genomic positionDPSCF300246 - 91561-98230
RNAseq coverage568x (Rank: top 22%)
Annotation
HeliconiusHMEL0026050.057.17% 
BombyxBGIBMGA008181-TA3e-11651.27% 
DrosophilaCG11474-PC2e-5831.13% 
EBI UniRef50UniRef50_D6WET21e-7937.26%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WET2_TRICA
NCBI RefSeqXP_971624.22e-7134.77%PREDICTED: similar to CG11474 CG11474-PA [Tribolium castaneum]
NCBI nr blastpgi|3838522948e-7235.56%PREDICTED: UPF0364 protein C6orf211 homolog [Megachile rotundata]
NCBI nr blastxgi|3838522943e-7035.36%PREDICTED: UPF0364 protein C6orf211 homolog [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[319-486] IPR0027913.5e-39Domain of unknown function DUF89
Orthology groupMCL12103 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205137-TA
ATGAGTAAAGTATCTGTAAAATCTCCTCCAACCTCAAAAAGTTCATTAGAAAAGAGTGCAACTAGTTTAAATGAAAATGACGATCCTTCTACGACACCCGATTTTATTTACGGGGGTCCATTTATTAAAGTTCAAAGGCCGGAGTTTATGGATATAACGACACCTATAAACGTACAGCTCCAGGGAACATACAAAAGGAGTTTTGCATATCATTCCTTAAAAGAACGGTTTCCGGTCATCCTAACCAAAATTATAGACTATCTTTCGCGAGAGGGCGGTAAAGTTAAATCTGCGAATAATGCCTCGGACGAGGACATTCGTAATCTGATAGAGTATGTTGTGAAATTAAAAAATGATGTAGCAACAAATAAGAAGTACGACCTCCTCACTGTGGATACACCGGAAGCCAAGAAGTGGAACGAGTGGATACTGAGTGTGGACAGTCCGTATTACTTCACCAACACATGGGTCTTTTCTGAATGTTATGTTTACAGGCGACTGAGAGAGGGCTGCGAATTAACCAAAGGCTTAGCAAACTTTGATCCGTTCGAAGATCAAAAAATTAAATCCTTCACCGGGTCCCTGGAACCGATGTGTGTTGTAGCAGAGAAGGTTATATCTATGATGGATCCTTCAGACAAAGATAAGCGGAAAGCTGACTTTATAACTTTACTGAAGTTGTGTCTTTGGTCTAATAAGTGTGATCTGTCCTTGTCTATGGGTGAACAAGTGACGTTAGGTAACAAGGATATTGATGCAAGTCCCTCTCAAACACTCATCGACCCCATACAGATGATTGTGGATTACAAGGACAAGGTGCTAGTGGATGACTCTAATAAAATCAGTGATCAAGTTTTCACTAAGGCTGAAAACATAGCTAAGGCTATAGAGGCGAACACGACCTTCAAAGCTAAGTGCAGTTGTCAGCGTCTGGCCGGAGCTCCTCCTCCGCCGGTGAGATTCCATGTTAAGAAGATACCCTGGTTCGTGTCTGACGTCACTCCCAAGGATTTTAATTTTGTCATCAACCAGTGCTGTGCCGCGACTTTCAACAAGGTGGTTAAGCTGCCGGCCGAGGGAGATCGCGCCGAGGGCGAGTCCGAAGCACCTCCGACCAGGACGGTGGCCGACGAAGCCCTGAAGACACTCGGGTCCCGATGGGCCAAGTACGTGGACAGTGGCGCGTTCGTAGTTATGTGCGACGATTTCTGGACATCGCCCCACGTGTATAAAGACATGAAGAAATACGAACCGAATCTGTACCGGAAGTTGCAATTCGCGGCCGCCATCTTGTTCAAAGGAGATCTTAACTACCGGAAACTGTTGGGAGAAAGAAACTGCAATCCATGTATAGGATTTGAACCCGCCTTGCAAGGCTTCATCCCGGCTCCTATCATAGCTGTGCGGACGGTGAAGGCGGACCTCATCTGCGGCTTACCGAAGGGCAAGTGGGAACAACTCACCAGGATCGATGACCGCTGGATGGAGACGGGCAACTACGGGGTAATCCAGTTCTGTGCCAAGGCCGAGGCCCTGAAGGTGTCTGACAAACCTTGCGAACAGTACGGAGACACTTGCAAAGATGATGCCTGCCTCGACCTGGACATAGTGTGA

Protein sequence:

>DPOGS205137-PA
MSKVSVKSPPTSKSSLEKSATSLNENDDPSTTPDFIYGGPFIKVQRPEFMDITTPINVQLQGTYKRSFAYHSLKERFPVILTKIIDYLSREGGKVKSANNASDEDIRNLIEYVVKLKNDVATNKKYDLLTVDTPEAKKWNEWILSVDSPYYFTNTWVFSECYVYRRLREGCELTKGLANFDPFEDQKIKSFTGSLEPMCVVAEKVISMMDPSDKDKRKADFITLLKLCLWSNKCDLSLSMGEQVTLGNKDIDASPSQTLIDPIQMIVDYKDKVLVDDSNKISDQVFTKAENIAKAIEANTTFKAKCSCQRLAGAPPPPVRFHVKKIPWFVSDVTPKDFNFVINQCCAATFNKVVKLPAEGDRAEGESEAPPTRTVADEALKTLGSRWAKYVDSGAFVVMCDDFWTSPHVYKDMKKYEPNLYRKLQFAAAILFKGDLNYRKLLGERNCNPCIGFEPALQGFIPAPIIAVRTVKADLICGLPKGKWEQLTRIDDRWMETGNYGVIQFCAKAEALKVSDKPCEQYGDTCKDDACLDLDIV-