Monarch geneset OGS2.0

DPOGS208766
TranscriptDPOGS208766-TA1260 bp
ProteinDPOGS208766-PA419 aa
Genomic positionDPSCF300036 - 932777-937038
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0151321e-9288.02% 
BombyxBGIBMGA014395-TA1e-11091.98% 
DrosophilaCG9839-PA3e-1123.89% 
EBI UniRef50UniRef50_E2C9Y46e-4043.89%PiggyBac transposable element-derived protein 4 (Fragment) n=1 Tax=Harpegnathos saltator RepID=E2C9Y4_HARSA
NCBI RefSeqXP_001943184.16e-4543.72%PREDICTED: similar to AGAP012447-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|2211050597e-4143.38%PREDICTED: similar to piggyBac transposase Uribo1 [Hydra magnipapillata]
NCBI nr blastxgi|2211050597e-4142.73%PREDICTED: similar to piggyBac transposase Uribo1 [Hydra magnipapillata]
Group
KEGG pathway 
Orthology groupMCL22652 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208766-TA
ATGTCTGCTAGAAGACATGAGCAATTATTGCGTTGTCTTTACGCTTCTGAAGTTGATGCTAAGGGATCAAGAAAAATAACTCCGTTTATTGACACAGCTACAGCACATTTTCAAAAGATTTACAATGCTGGTAAAAAACTTTCTTTAGACGAATCTTTACTTCTATTCCGAGGGCGATTGCAATTTCGTCAATATATTAAATCTAAAAAGGCTCGTTACGGTATCAAGTTCTATGAGCTCACAACGTCGGATGGATATGTCTTGAATATTTACATGTATTGTGGTAAAGAAAATCTAGAAGAAAACATTTCAAAGACTGAAGACCTAGTTTTGAGGCTAATGCGACCTTACCTTTTGAAGGGACACCATTTATATATGGACAACTTTTATAACTCTGTTAGCTTGTGCAAGAAATTACTCGGTTTGAGAACTCACACTGTTGGAACGTTGCGGTCAAATAGGAAAGGTAATCCTGACAGTATTGTAAAGAAAAAATTAAAGAAAGGTGAACACGTTTGGGTCCGAAAGAACAATGTTTACTTATCTAAATGGAGGGATAGTCGCCCTGTCCTGATGATTTCTACAAACACTCATCCTACAATGGTTGAAGTACAAAACAGATTTGGGAGAAAGAAGATGAAGCCTCTTGAGATGCCATACTTGAATCAAAAGCAAATAGAAGAGTATTTAGCTCTAATAAATGATGGAGAGATTTCGGAAGATGATCTTGAGGACTCGGATACAGAAGGTAATGAGAATTTTTATGAAAATGGAGACGATTTGCTTCACGATCTTGAGAGTCCATTAGAAGAGGAAAATGATGACCCCGATAATGATCCATTCATGGCAGGAGACCCGCCATTGATAAATGAAGCTACAGCAAATGATCAAGTGCCACAGGTTCCATTCCCCTCAACAAGTCAAGCTAGTCGCCGAGCAACTATGAGAGGACTCGTATGGAAGGTAAAAAAAATTGTTTTGAATTCTGACCAGACAGCTTTTCATGGAGACACTACATACCCATCTGAACTGAAGGATGAGGCTACTACTGAAATAGTTCAAAGGATTGAGGAAGAGTCTAATTTGTATTCCGTTCAAAAAAATGTGACTAAACCATTAAATTTATCTGAAACTGAGCTAAGAAAGTTTATGGCGATACTGATATACATGTCTTTTATAAAATATCCCAATGTACGGCTATACTGGTCAAATACTGTAGGTTTTCAACCAATCAAGGACATAATGACTCAACCGATTTGA

Protein sequence:

>DPOGS208766-PA
MSARRHEQLLRCLYASEVDAKGSRKITPFIDTATAHFQKIYNAGKKLSLDESLLLFRGRLQFRQYIKSKKARYGIKFYELTTSDGYVLNIYMYCGKENLEENISKTEDLVLRLMRPYLLKGHHLYMDNFYNSVSLCKKLLGLRTHTVGTLRSNRKGNPDSIVKKKLKKGEHVWVRKNNVYLSKWRDSRPVLMISTNTHPTMVEVQNRFGRKKMKPLEMPYLNQKQIEEYLALINDGEISEDDLEDSDTEGNENFYENGDDLLHDLESPLEEENDDPDNDPFMAGDPPLINEATANDQVPQVPFPSTSQASRRATMRGLVWKVKKIVLNSDQTAFHGDTTYPSELKDEATTEIVQRIEEESNLYSVQKNVTKPLNLSETELRKFMAILIYMSFIKYPNVRLYWSNTVGFQPIKDIMTQPI-