Monarch geneset OGS2.0

DPOGS212245
TranscriptDPOGS212245-TA1374 bp
ProteinDPOGS212245-PA457 aa
Genomic positionDPSCF300488 + 51969-59696
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0071336e-11192.89% 
BombyxBGIBMGA009658-TA1e-15587.07% 
DrosophilaRbcn-3B-PA2e-9062.09% 
EBI UniRef50UniRef50_Q16IV42e-12063.09%Tgf-beta resistance-associated protein trag (Fragment) n=1 Tax=Aedes aegypti RepID=Q16IV4_AEDAE
NCBI RefSeqXP_970256.22e-12866.58%PREDICTED: similar to AGAP008003-PA [Tribolium castaneum]
NCBI nr blastpgi|2700096821e-12866.76%hypothetical protein TcasGA2_TC008973 [Tribolium castaneum]
NCBI nr blastxgi|2700096827e-13067.68%hypothetical protein TcasGA2_TC008973 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL10625 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212245-TA
ATGAAACTTGTAAATCAATGTGATCCTAGTGATATTCCTCTTTCTCGTCGTTGGCAACATCACTGTCTGGAGGTGAGAGATGCTGCCCAGGCGCTGTTGTTGGCTGAATTGGGCAGGATGGGACCAAAAGGCCGTAAATCCCTGGTCGATAACTGGGCACAATATCTACCGCTTTACACTCACACGGAGAGCATCAACCCACAAGCAACCCAGAAGGAACCGGCTGAGAAAGTCGCTAAGGACCAAGAAGAGGAATTCGAAGAAGAAGAAGAGGAAATCATCAGGAAGCCATCTTCTATCGCTGAGCTGAAACGGAAACAGACCACAGCGGTTGTTCTGTTGGGTGTCATAGGAGCTGAGTTCGGCCAAGATATCGCCTCGGAAGGTGCCGCTGGAAATAAGAGACGCAAAGACGGTGACAGTAGGAAGAGTTCCATCGTTGAAGGTTTCACACTTAGTTCATCGAACAACCTGGCCCGTCTGACGTCCTTGGCGCTGACTCACTTACTGCTGGCCCCTGGTTCTGCTCGCCTTCCCGCACACACGCCGCTACGACGTGCGGCCATTGACCTCCTCGGGAGAGGATTCGCTGTTTGGGAACCGTATCTCGACGTATCTCATGTGCTGTTGGGTCTGTTGGAGATGTGTTCGGACGCTGATAAATTAGTGCCTTCGATGACGTACGGTCTGCCTCTCACACCGCAGGCGGACTCCTGTCGCACCGCGCGGCACGCACTCACACTTATAGCGACAGCAAGACCAGCGGCGTTCATAACGACTATGGCCCGTGAGGTGGCGAGATGCGCTGCCGCCCCTGCGGGCGCTCCGCCCCCGCCCGCCGCGGTTGCGCTACAGCGAGGCAGGGCGGAGGTCTTACGGGGAATAGAACTCCTCATAGAGAGAATGCACGGCGCTGTGGCTGAACTGCTCGTGGAGGTGATGGATATAATTCTTCACTGCGTGGATCAGTCGCATCTGAAGAGTAAGGGTTTAAGTGAAGTCTTTCCAGCTGTTTGTCGCTACAACCAGGTCTCTCACTGCCCAGCCACCAGGAGGATTGCAGGTAGCCTGAGCGATCAGGCCCTGTCCCTTTTGGGAGACTCCACTTTGCATTACTTTGGATTACACTGTAGGACACTGCAATACCACAGCCGACATCGTCCTTTAAATCATCCGTATCACCAAAAAATACTCCCAGAATCGGTGGCTCTCGAGTTACGTTGGTGGTTGAAAGCAATCGCGAGCACTCTGCCGATACACTTGGGCTCGGTAACGCATCACGCGAAGACTGATGCCTCGGACATCGGCTGGGGAGCGCAGATAGAAGAGACAAAGTTATCAGGCCAGTGGATCGAAGACAAACATGGCATGTGA

Protein sequence:

>DPOGS212245-PA
MKLVNQCDPSDIPLSRRWQHHCLEVRDAAQALLLAELGRMGPKGRKSLVDNWAQYLPLYTHTESINPQATQKEPAEKVAKDQEEEFEEEEEEIIRKPSSIAELKRKQTTAVVLLGVIGAEFGQDIASEGAAGNKRRKDGDSRKSSIVEGFTLSSSNNLARLTSLALTHLLLAPGSARLPAHTPLRRAAIDLLGRGFAVWEPYLDVSHVLLGLLEMCSDADKLVPSMTYGLPLTPQADSCRTARHALTLIATARPAAFITTMAREVARCAAAPAGAPPPPAAVALQRGRAEVLRGIELLIERMHGAVAELLVEVMDIILHCVDQSHLKSKGLSEVFPAVCRYNQVSHCPATRRIAGSLSDQALSLLGDSTLHYFGLHCRTLQYHSRHRPLNHPYHQKILPESVALELRWWLKAIASTLPIHLGSVTHHAKTDASDIGWGAQIEETKLSGQWIEDKHGM-