Monarch geneset OGS2.0

DPOGS210392
TranscriptDPOGS210392-TA1203 bp
ProteinDPOGS210392-PA400 aa
Genomic positionDPSCF300291 - 72307-74780
RNAseq coverage280x (Rank: top 39%)
Annotation
HeliconiusHMEL0148352e-13770.30% 
BombyxBGIBMGA008241-TA2e-13270.82% 
DrosophilaCG2278-PA3e-1551.28% 
EBI UniRef50UniRef50_E0VIQ01e-2530.68%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VIQ0_PEDHC
NCBI RefSeqXP_975604.12e-3950.27%PREDICTED: similar to mixed lineage kinase [Tribolium castaneum]
NCBI nr blastpgi|910811933e-3850.27%PREDICTED: similar to mixed lineage kinase [Tribolium castaneum]
NCBI nr blastxgi|910811932e-5740.24%PREDICTED: similar to mixed lineage kinase [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL24880 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210392-TA
ATGTTGGTGGTGGATACAATAAAGAGGCTCCGATGTTGTTTTCTCCCTTACGCAGACTTCAGCCCGGACGACTGGGGCCCGGAGTTTCCTTTGAGTACCTCTTCTAGACACACTATACCGATGCCGACACTATACAGCACTGAAGGATACAAATACAAGAAACCTAGTATAATTGAACTAGTTTTATACAATATGGCAGCTCTCCTGGGCGGTGTGGCCGCCGGGTACGATGTAAGAGTGTCCAACGTCAGTCCATTACATCCCACGCTACAACCGCATAAGCCGGAATTAGAGACGGAAGTCATAGGCACCTCGATCTACGAAGGCTTCGGCTTCGCCCACAACACGTACCACGGACCCACGAGACATTTGCGGGCGCCACTTAACGCTATCACCGGCTCACCGATATCTAGTCTTCAAGCGGAGGAGCAGAAGCCGATCCGGTTTACGGACTCCCCTACGCACTACGCTCATCCGTCGTCATCGTCAGGGTACGCGCACTCCGCGCAGCCTCTCTCCGGGACCTCGGGCCTGGGCACCAACTACACCTCCACTCAACCCACGCCCTCGCCTCGCAGGACCTCCTCCACCACCTCCGACCACCTCGCAGACCTCTACCACAACTACAGGGAGAACTTCCAGCCGTCGTCCGCATCGCAGGAACGCATACAGGACTACTACTACCGCGCAGAGCCGTACGCGTACGATAGGACCGCGGACTACGTCGTGAAGCACCCCTACTACGGATACGACGAGTGCTCCTTTGAGTACAGAGACTCGCCGGAGTTCCGCCCGCCCTACCTCGCACACAGACGGACACCTTCTAACTCCTCGGCGACGAACTCCCATCCTGAGGAGTTCTCCCCTTCCAGACAGTACCGGGAGAAATACCCCAAGGACAGGAGGGACGAGTACCGGACCCCGCAGAAGGAGTACCCGAGGAAGTCCAGCGACTACTCGCTCCCGCGGGACTACTACCGACCGGAACCGGAGACGGAACGGCCCGCCATCCTCGGCTTGGAGCCGACGAAGCTCCGGTCCAGTCTGAAGAAGTACAAGAAGGGCTCGAGCAGCTCCGGCGGGGGTTCCCCCAACCCCACGCCGCCGGACAGCCTGAGCGAGACCGACAGCTCGTACGCCTCGGCCCGCGAGTCGGCCGGCTCCGCGCGCGTTAGGTTCAGTCCGGACGCGCGGCGCTCGTAG

Protein sequence:

>DPOGS210392-PA
MLVVDTIKRLRCCFLPYADFSPDDWGPEFPLSTSSRHTIPMPTLYSTEGYKYKKPSIIELVLYNMAALLGGVAAGYDVRVSNVSPLHPTLQPHKPELETEVIGTSIYEGFGFAHNTYHGPTRHLRAPLNAITGSPISSLQAEEQKPIRFTDSPTHYAHPSSSSGYAHSAQPLSGTSGLGTNYTSTQPTPSPRRTSSTTSDHLADLYHNYRENFQPSSASQERIQDYYYRAEPYAYDRTADYVVKHPYYGYDECSFEYRDSPEFRPPYLAHRRTPSNSSATNSHPEEFSPSRQYREKYPKDRRDEYRTPQKEYPRKSSDYSLPRDYYRPEPETERPAILGLEPTKLRSSLKKYKKGSSSSGGGSPNPTPPDSLSETDSSYASARESAGSARVRFSPDARRS-