Monarch geneset OGS2.0

DPOGS214219
TranscriptDPOGS214219-TA1326 bp
ProteinDPOGS214219-PA441 aa
Genomic positionDPSCF300014 + 783616-785428
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0164151e-17871.11% 
BombyxBGIBMGA005945-TA4e-13660.14% 
DrosophilaCG3880-PA7e-3350.00% 
EBI UniRef50UniRef50_D7EJ901e-9748.74%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EJ90_TRICA
NCBI RefSeqXP_396790.24e-9345.88%PREDICTED: similar to CG3880-PA [Apis mellifera]
NCBI nr blastpgi|2700161613e-9748.74%hypothetical protein TcasGA2_TC006850 [Tribolium castaneum]
NCBI nr blastxgi|2700161613e-9548.52%hypothetical protein TcasGA2_TC006850 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL13598 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214219-TA
ATGATGCACTATAGACCTACCAGTTTGGAATCTTTGCCGAAGCAATTTGTTTCAGCAATGAGAACCCTTTTTGATATCATGGATGATAAACAAACTGGTTATGTTAAATTAACAGATATCGAAAATCGATGGCGTGATGACCGTACCAAAGGTCTTCCTAGAGGTGTTATTGAAAGTCTTCAAAAGGTAGCATCCCATGATGGCTTACTAACATTTGAAAGTTTTTGCACAGGTTTAAAAATATGCCTATTACAAAATCATGCACAAAACGCAAAGGTTAACGCTATCAATGCGGATGTCGATAATTCTGTTATTCGCAATCAAATATCTCAGATGAGCCGACCACCTTCGGCACCTTTAATTGATATAGATAATGATAATGACAGAGATACGAATTGGAATTTAATGAGAGTTCCTACGAATTCCAATCAACGTAGAACAAACAGTATGCCCCAATTAATAGAAGACAAAGATTATTTGAATAAAGAAAAGGATCCCATACCAAAATTGCTTCAAAATGACAATGAAAAATCTGTTTCTAAGCCAAATGCGTCTCAGGGTCTTAGTGTTTACGGACCTCCGAAACCTCCTCGATTGGAGAAAAATGCGAGTGATAGGAAAGATGTTTCCAGAATAGTTTATAAAACAGATGATATGTTCTTAGAGAGTAATAATATGGGCAATAGTAAGTTGGATTTCCAAGATTGTATGTCTCAGTCATTATTAGACGGACAAGGTCGGGGTCTCGGCGATGGTCGTTCAGCTATTGGAACGCAGACGACTTTACGAAAAGCGTCACGGCGAAGGGAGCCCCGTCGCCACACCTTACACAACGGTGTTGATTATAATCTTATAAAAAGGATGAAACAAATTGAACAAGAAAAAGATGTATTGTTGCAAGGCCTCAGTGCAGTAGAAGAAGCTAGAGAATGGTATCTCAAACAACTAGCCGAAGTTCAAGACAAAATGCGATATGCTGGCAGGATGGGAGCTTACATTGAACCTTGGAATGAAGCCCACCAAGAACGAATTGAACTGTTAAAGGCAAGAGTATTGGAATTAAACCGCCAGCTGGGTGCTTTAGCAGCAGGGTGGCGTCATGGGCCTTTATCCCTACACATGAATCTCGCCCTACCCACAACAATGAACACAATGACCACTGCACCAAATAATGCAAATGTTCTACGATCTCAAAATAGAATGCTAGCAGAGGAGGTGAATCGCAAAAATGAAAGGATTACTGTTTTAGAGAGAGAAAAATCAGCCCTAATTAGAGAACTATTGCTGCAATGTAACAGAACAAAACTTATGGAGAGTTTTTCATAG

Protein sequence:

>DPOGS214219-PA
MMHYRPTSLESLPKQFVSAMRTLFDIMDDKQTGYVKLTDIENRWRDDRTKGLPRGVIESLQKVASHDGLLTFESFCTGLKICLLQNHAQNAKVNAINADVDNSVIRNQISQMSRPPSAPLIDIDNDNDRDTNWNLMRVPTNSNQRRTNSMPQLIEDKDYLNKEKDPIPKLLQNDNEKSVSKPNASQGLSVYGPPKPPRLEKNASDRKDVSRIVYKTDDMFLESNNMGNSKLDFQDCMSQSLLDGQGRGLGDGRSAIGTQTTLRKASRRREPRRHTLHNGVDYNLIKRMKQIEQEKDVLLQGLSAVEEAREWYLKQLAEVQDKMRYAGRMGAYIEPWNEAHQERIELLKARVLELNRQLGALAAGWRHGPLSLHMNLALPTTMNTMTTAPNNANVLRSQNRMLAEEVNRKNERITVLEREKSALIRELLLQCNRTKLMESFS-