Monarch geneset OGS2.0

DPOGS210334
TranscriptDPOGS210334-TA1710 bp
ProteinDPOGS210334-PA569 aa
Genomic positionDPSCF300025 - 468998-470932
RNAseq coverage171x (Rank: top 50%)
Annotation
HeliconiusHMEL0138320.065.64% 
BombyxBGIBMGA011975-TA2e-17056.07% 
DrosophilaCG9300-PA2e-3327.38% 
EBI UniRef50UniRef50_D6WGP41e-7030.17%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WGP4_TRICA
NCBI RefSeqXP_001608202.17e-7329.36%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565380061e-7129.36%PREDICTED: nucleolar protein 11-like [Nasonia vitripennis]
NCBI nr blastxgi|1565380061e-7328.69%PREDICTED: nucleolar protein 11-like [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL14698 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210334-TA
ATGGCAAAATTGCACAACTATTACGTTTTGTGTCCATTGATCGATCAAAACAGTTTTCTAGGAGTATCACAAGATAAAGATGATGAAAATGTTATCGTTACACTAGGCAGAAACGTTGTTAATAAATACAGGCTTTCAGATCAAAAACAAATTGGAGGCTGGACTTCAAAGGAACATCTCACGTCTTATGTTATTTATGATAAGGAGCAAGAAGGATATGTCGGAGTTTTTAATAAAAACACTATAAAGATATGGAAAGAAGATTCAGACAATTTAGACAAAACAAAAAAACACAAATTTTCAGTCAACATCTTAAAGCTTCAACAGAAAGGTGATAATACAATAATTATATTTGAAAATGGTTATTGTGCGTCCCTTTCCTATGCATTGGAAAATAGAAAAACATATGAAGGGAAACCTCTTATAAAAGATGCCGAAACTGTTGTGGATTCAGCTTGTTTTACATTAGATAAAACAGATTATATATGCTACGTCATCAAGAATACTAGCAACAATTATGAAATTCTCACGAGTCCACTAAGAGAAGAACTGGGTGACATGGACAAGTCCAAAATATGTAAAGTTAAAGTTACGAGGCCCCATGACGTGTATGTTGTTGGGAAACTTATTAATATAGATGAGAGCCCATCCGTTTACATTTTGTGGAGTGACTCTAAGATGTCAGTATACAATCTTGTGAGTAAATCATGGACAAACATTGGCACTGTACCCTGGATATCAACACTAACAAGCGTCTCTCTGGCTTGGATGGGGAAGGATCATCTCATTCTCTTCGGAAGCAACACCGATCAAGATGGAGCAATCATTGTGGCTTACAATGTCATATTGGGTGTGGGATCTTGCAGGTATCCCATGAAAATGTATGCTGAAAATGCTCACTTATATTGCTTCAATGGACGCATAATTCTGGAAGCATCCAATCACATCGGAATGTTGCCTTACATCCTTGAAACAAACAGAAACTTGTCAAGCCTTTTGGGTTCTCACGACACAACTGAAGACAGCTGCATTGAAGTAGCTGAGTGGGGCATAAAATCAAATCCTCTGTTTGCTGAGAGAGAAGAAATAAAAGACTTACTCAAAGTCGGCGTCACGGAACGTAACATGTGCTCACAAGTTATACTGCCTTTATTAGAAGAGAAGGATTTCAGACATGTGTACAATGTTGTTAGAGAATTCAAGGATGTTCCTGAATCAGTCCTAGTTTCAATACTTAACTATACAATTGAAATTTTAAATGCAAAGGAGATAGATGTTAATGATCATGAAGAATTCATGAAATTTTGTGATTGTGAAATTTTAGATTACTTGTTTGAAATAACCTTCAGTGACGCTCTGTTAATACCTTACTTGAGAAATGGACTTACACTAGATAACGCCTTATTTCTACTTTCATATATATCGTACCTGCTCACGGATTCTCATAAAGAATACAGTGATGTCTATGAGAGCAAGTTATTTGATTGGTGCACTTTGCTCATAGATGCTTTTTATCAACAGTATCTATTGACTAAAGATGACAAAGTTGTACAGGTTTTGAACAATGTGCAACGAGTGGTAGTCAATCTCATCGATCAACTCATGACAGTTGATAATGTTTTACCGATGCTACATAAAATTCTATCAGGAAAACCTCAAGTTGATCATGAAGAATCCTTGTCGTATACAATTGAGCTAATGGATATATAA

Protein sequence:

>DPOGS210334-PA
MAKLHNYYVLCPLIDQNSFLGVSQDKDDENVIVTLGRNVVNKYRLSDQKQIGGWTSKEHLTSYVIYDKEQEGYVGVFNKNTIKIWKEDSDNLDKTKKHKFSVNILKLQQKGDNTIIIFENGYCASLSYALENRKTYEGKPLIKDAETVVDSACFTLDKTDYICYVIKNTSNNYEILTSPLREELGDMDKSKICKVKVTRPHDVYVVGKLINIDESPSVYILWSDSKMSVYNLVSKSWTNIGTVPWISTLTSVSLAWMGKDHLILFGSNTDQDGAIIVAYNVILGVGSCRYPMKMYAENAHLYCFNGRIILEASNHIGMLPYILETNRNLSSLLGSHDTTEDSCIEVAEWGIKSNPLFAEREEIKDLLKVGVTERNMCSQVILPLLEEKDFRHVYNVVREFKDVPESVLVSILNYTIEILNAKEIDVNDHEEFMKFCDCEILDYLFEITFSDALLIPYLRNGLTLDNALFLLSYISYLLTDSHKEYSDVYESKLFDWCTLLIDAFYQQYLLTKDDKVVQVLNNVQRVVVNLIDQLMTVDNVLPMLHKILSGKPQVDHEESLSYTIELMDI-