Monarch geneset OGS2.0

DPOGS203874
TranscriptDPOGS203874-TA1374 bp
ProteinDPOGS203874-PA457 aa
Genomic positionDPSCF300402 - 99049-105872
RNAseq coverage1x (Rank: top 93%)
Annotation
Heliconius% 
BombyxBGIBMGA014462-TA2e-0835.79% 
Drosophila% 
EBI UniRef50UniRef50_E5S5M24e-1030.77%PiggyBac transposable element-derived protein 1 n=2 Tax=Trichinella spiralis RepID=E5S5M2_TRISP
NCBI RefSeqXP_001942731.17e-0835.37%PREDICTED: similar to hCG32740 [Acyrthosiphon pisum]
NCBI nr blastpgi|3392378511e-0930.77%piggyBac transposable element-derived protein 1 [Trichinella spiralis]
NCBI nr blastxgi|3392378511e-1028.74%piggyBac transposable element-derived protein 1 [Trichinella spiralis]
Group
KEGG pathway 
Orthology groupMCL29161 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203874-TA
ATGTCGGCAAAACCGAATGGAGGACAACAACGGCGTAGGGCTGAAAGCCCTCCTCAGTCAAACGTTATAAGCCAACAGTTGGAAACAATTCTGAATCGTCTGACGGCATTGGAGCAACAATTGACATCTGTGGAGCACGGGCCAATGTCGCTGCGTGCTTCAACACTGGAGACGAGCTTTGAGACGGGGGACGTGCGGCGCGATTTGCGCGAGGTACCCGCACCTACCGAGAGACCACTACTGTCTCCGGGTTCCACGGCGGCCACAGAGGTTGCTGAGAAGTTTTTAGAAGCGATAACCTCCTTAACCACGGAAGGTGATGACGAGGAAATGGTTACCGCCTTTTTGCCGCGGGATGTGCCTGGCAATATTGAGGATTTTAGAGTTCACGAGGACAACATTCAAAGTGACGATAGCAGCGACGAGGAAGCATTGGCTGAAAAAGCAAACAGAAGACGAAAGCAGCTATGTCGGCCTGTATGGCGTAAATGTTCCCCTACGTACTCTTCTACGACTGAAGAAAGAACTAATGTGCAAGAAAGACAGGAAGCAGTAAATGAACAGCTCGGAGAACTTAGTCCCGTACAAATATTTGAAAAAATGTTAGATGAAGAAGTTACAACCCTGATAATTACTAACACAATTGAATATGCTAATCAAAATAACAGACATACTTTTCAACTAGACTTCATTGATTTGAAAAAGTTTATAGGTATATTGATACTGTCTGGATATCATAAACTACCCAGGGAGGATTTGTACTGGTCTTATGATGAAGATGTCGGTGTTGAAATGTCCCGCTCCGTCCGTAAGCGTCAGTTGAGGATCTCAGCGGGAGGTAGCTCGTCGGGCAGAATCCCGAAAGTTGCGGCCACCACTTCAGCCAAGCGTGGCAGTGGACAGCCCCCTATTACGAGGATGAACGTCGGCTTGACCAAAGCGAAGGAGGCTGTAGACCGGCAGAAGCGCGATTCGTTCCTGCTGGATTCACAAGAAGAGATCACTAGGGAACAAATGGTCTCAAATTGTAACGAATTGGTACCCTATACACAAAAAAGGCGCAAAGGCAAGCCGGTCAAACGCTGGAGTGATGATATAGTTGCCACGGATGGAATAACTTGGGCAGGACTAGCCAGGAATAGAGACACTAGGAGAGAAATGGAGGAGGCTTTCACCGCCAGAGCATCTGATTACAGTGCAGGCGAGAGGGATGAAGCTGTATTTGGGAACTCCCTCCCTTGGGAAATCGGCATCTACAGTGTTCAGAGACCAATATTCCAAAATCTAAAGAATCAAACAATACTGTTTTCGGTGGCCGACTTTCTTTTAAAACAATTAAGCATTGTTGTAAAAGAAAAAGACCCTCCCGGGTGA

Protein sequence:

>DPOGS203874-PA
MSAKPNGGQQRRRAESPPQSNVISQQLETILNRLTALEQQLTSVEHGPMSLRASTLETSFETGDVRRDLREVPAPTERPLLSPGSTAATEVAEKFLEAITSLTTEGDDEEMVTAFLPRDVPGNIEDFRVHEDNIQSDDSSDEEALAEKANRRRKQLCRPVWRKCSPTYSSTTEERTNVQERQEAVNEQLGELSPVQIFEKMLDEEVTTLIITNTIEYANQNNRHTFQLDFIDLKKFIGILILSGYHKLPREDLYWSYDEDVGVEMSRSVRKRQLRISAGGSSSGRIPKVAATTSAKRGSGQPPITRMNVGLTKAKEAVDRQKRDSFLLDSQEEITREQMVSNCNELVPYTQKRRKGKPVKRWSDDIVATDGITWAGLARNRDTRREMEEAFTARASDYSAGERDEAVFGNSLPWEIGIYSVQRPIFQNLKNQTILFSVADFLLKQLSIVVKEKDPPG-