Monarch geneset OGS2.0

DPOGS208962
TranscriptDPOGS208962-TA1491 bp
ProteinDPOGS208962-PA496 aa
Genomic positionDPSCF300009 + 848763-857254
RNAseq coverage623x (Rank: top 21%)
Annotation
Heliconius% 
BombyxBGIBMGA002435-TA3e-17470.04% 
DrosophilaCG42268-PD1e-0733.33% 
EBI UniRef50UniRef50_E9IBS44e-1436.69%Putative uncharacterized protein (Fragment) n=3 Tax=Formicidae RepID=E9IBS4_SOLIN
NCBI RefSeqXP_001813542.11e-1431.03%PREDICTED: similar to AGAP005803-PA [Tribolium castaneum]
NCBI nr blastpgi|3838632625e-1635.64%PREDICTED: uncharacterized protein LOC100878688 [Megachile rotundata]
NCBI nr blastxgi|3838632626e-1636.41%PREDICTED: uncharacterized protein LOC100878688 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL44363 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208962-TA
ATGCGTCAAAGTGCCACGTCGGGGCACCAAACCCTGGGTGCTAGGCGTCAGAAGACGGACCTCCATGTAGCTGAGCCCACGAGGTCCGCCAGCCCAGCCCCCAGTCTCCGCAGCCAAAGGGCCAAACAGTACAGGGCCAGTCTTCCAGTACGCACGGCTCGTTTGTTGCAACGGTCACGTACTCAGACACCATCAGATGCCGGGATATCAGACGGGAGCACATCCCCTGCACCAGCGCCCTCATCATTACTTCAAGATGTCGTCGACATCAAAACCCTACTGTTGCAACTCAAGAGGGTATTACAAGAGAGCGAAACTCTCGACCCGTTGCTAGCGGCGTGCGCGGAGAGTCCCGCGCGCGGCAACGGGCGCCGCCTCCCCGCCTCGCCGCTGCACCCTGATGCTTGTGCCGAGATGCGTCGCCAAATAGTCTACCTACAGGGTCAGTTGGAGGAGAGAGATCGATTGGTGCGTGTGTTGCAACAACAGATGCTGCGTATGGCTGAGAGCCATGAGCCGAGAGTTGATGACACCTGCAACGTCGCCACACAGACCGATCGATTGCGGCCACCTATCGGCAGCTCGCTAGCGAGTTCTGAAAATTCCGGTTTAGTGAGGTTATTCGCCGCTTCAATAAATTCAATCAAGTCCATTATTCACATGATGCAAACATCACTCACGTTTTTAAACACAAAATATATATCATATCTGAACGATAAAAATCAATATAATGCTATATGTACAAATACATTCTCAGTTAAAAATTTCACGAAATATTTAAGAAGTCTAGAATGGGCTGCTGTTTTCGGCGACTATATGAATAAAAAAGTCATTTTCTCGTCTACTTCTATTTGGAATGAACAAGCTAGTGGAAGGAAGCTGCCGTATAGTGAAGCACAAAAGACAAAAAGGCCTGGTAGTAATGGGACCGTGAAATGTCAGTGCGGTGCGAGAAAAGTGAACGGACTATTAGACGAAAACAGACAGACGGACAATCTTAATGACAAGCGGGACTCGGTTCTGCTGAAGCCTAACAGTCGGGTTGGATCTAAATACCTTCAAGCATTAAACAACACAGACAATAGAAGATATATCAAACGAGATTCGTCTCTCAACGATAGATATGAACGACGAAGCCTGTACTCGTCACGTTCCAAATCGATAGAGAATTTTCACAACCAGCATTATCTATCCACAGAGCCTTTACTTTCTAAAACAGCAGTTATCGAGGAGACTTCCATAAAACATCACACTTCCTATGGGGATCTTTACAATGGAAGGATAAGTAGCAAAGAGAGTGTAAACGGGAACAGACACAGTCCAAGTCTCGGTCAGTCCACAGACACGGATTATAGTTCGGATGGAGGATATAAATCTTTGCCATCTTCTATTAACTACAGTACGTCACCCAAAAAAGTAAGCGGAATACCGCGGAGGTCTTACGAACAAAAGTTTAATTCACCAAAACAAATCAGGGCGAGCGCCATATAA

Protein sequence:

>DPOGS208962-PA
MRQSATSGHQTLGARRQKTDLHVAEPTRSASPAPSLRSQRAKQYRASLPVRTARLLQRSRTQTPSDAGISDGSTSPAPAPSSLLQDVVDIKTLLLQLKRVLQESETLDPLLAACAESPARGNGRRLPASPLHPDACAEMRRQIVYLQGQLEERDRLVRVLQQQMLRMAESHEPRVDDTCNVATQTDRLRPPIGSSLASSENSGLVRLFAASINSIKSIIHMMQTSLTFLNTKYISYLNDKNQYNAICTNTFSVKNFTKYLRSLEWAAVFGDYMNKKVIFSSTSIWNEQASGRKLPYSEAQKTKRPGSNGTVKCQCGARKVNGLLDENRQTDNLNDKRDSVLLKPNSRVGSKYLQALNNTDNRRYIKRDSSLNDRYERRSLYSSRSKSIENFHNQHYLSTEPLLSKTAVIEETSIKHHTSYGDLYNGRISSKESVNGNRHSPSLGQSTDTDYSSDGGYKSLPSSINYSTSPKKVSGIPRRSYEQKFNSPKQIRASAI-