Monarch geneset OGS2.0

DPOGS213852
TranscriptDPOGS213852-TA1254 bp
ProteinDPOGS213852-PA417 aa
Genomic positionDPSCF300361 - 136461-141349
RNAseq coverage276x (Rank: top 39%)
Annotation
HeliconiusHMEL0071333e-8776.67% 
BombyxBGIBMGA009658-TA0.081.32% 
DrosophilaRbcn-3B-PA7e-12864.53% 
EBI UniRef50UniRef50_Q16IV45e-15459.05%Tgf-beta resistance-associated protein trag (Fragment) n=1 Tax=Aedes aegypti RepID=Q16IV4_AEDAE
NCBI RefSeqXP_970256.23e-15959.96%PREDICTED: similar to AGAP008003-PA [Tribolium castaneum]
NCBI nr blastpgi|2700096823e-16461.59%hypothetical protein TcasGA2_TC008973 [Tribolium castaneum]
NCBI nr blastxgi|2700096821e-15862.15%hypothetical protein TcasGA2_TC008973 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.6e-10protein binding
KEGG pathway 
InterPro domain[288-357] IPR0159431.6e-10WD40/YVTN repeat-like-containing domain
[322-356] IPR0197817.4e-06WD40 repeat, subgroup
Orthology groupMCL10625 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213852-TA
ATGCTCGCTCGTCGTTGGCAACATCACTGTCTCGAGGTGAGAGATGCTGCTCAGGCGCTGTTGTTGGCTGAATTGGGCAGGATGGGACCAAAAGGCCGTAAATCCCTGGTCGATAACTGGGCACAATATCTACCGCTTTACACTCACACGGAGAGCATCAACCCACAAGCAACCCAGAAGGAACCGGCTGAGAAAGTCGCTAAGGACCTAAATTTACCAAAAAAAATTACAGGTGCCGCAGGAAATAAGAGACGCAAAGACGGTGACAGTAGGAAGAGTTCCATCGTTGAAGGTTTCACACTTAGTTCATCGAACAACCTGGCCCGTCTGACGTCCTTGGCGCTAACTCACTTACTGCTGGCCCCTGGTTCCGCTCGCCTTCCCGCACACACGCCGCTACGACGTGCGGCCATTGACCTCCTCGGGAGAGGATTCGCTGTTTGGGAACCGTATCTCGACGTATCTCATGTGCTGCTGGGTCTGTTGGAGATGTGTTCGGACGCTGATAAATTAGTGCCTTCGATGACGTACGGTCTGCCTCTCACACCGCAGGCGGACTCCTGTCGGACCGCGCGGCACGCACTCACACTTATAGCGACAGCAAGACCAGCGGCTTTCATAACGACTATGGCCCGTGAGGTAGCGAGATGCGCTGCCGCCCCCGCGGGCGCTCCGCCCCCGCCCGCCGCGGTTGCGCTACAGCGAGGCAGGGCGGAGGTTTTACGGGGAATAGAACTCCTCATAGAGAGAATGCACGGCGCTGTGGCTGAACTGCTCGTGGAGGTGATGGATATAATTCTTCACTGCGTGGATCAGTCGCATCTGAAGAGTAAGGGTTTAAGTGAAGTCTTTCCAGCTGTTTGTCGCTACAACCAGGTCTCTCACTGCCCAGCCACCAGGAGGATTGCAGTCGGGAGTCACACCGGCCAGCTGGCTATCTACGAGCTCCGCGCCGCTCGCTGTCAGTCCCTGACGGCCCACGCGGGTCCAGTGACGGCCTGCGCCTTCAGTCCGGACGGAAGATACCTCGTCTCCTACGCCACGGCAGATAACAGACTCTCCTTCTGGCAGAGTACAGCTGGTATGTTTGGTCTCGGAGCTGCTCAGACTCGTTGCGTGAAGTGTTATAGCACGGCCCCGATGGCGGACGTCTCACGTCTGAACCCTCTTCACCTAGCCAGGCTAGTGTGGACCAACTCGAGGACTGTCACCTTAGTACTGGCGGACGGCTCGGAGACCAGATTCAATGTTTAA

Protein sequence:

>DPOGS213852-PA
MLARRWQHHCLEVRDAAQALLLAELGRMGPKGRKSLVDNWAQYLPLYTHTESINPQATQKEPAEKVAKDLNLPKKITGAAGNKRRKDGDSRKSSIVEGFTLSSSNNLARLTSLALTHLLLAPGSARLPAHTPLRRAAIDLLGRGFAVWEPYLDVSHVLLGLLEMCSDADKLVPSMTYGLPLTPQADSCRTARHALTLIATARPAAFITTMAREVARCAAAPAGAPPPPAAVALQRGRAEVLRGIELLIERMHGAVAELLVEVMDIILHCVDQSHLKSKGLSEVFPAVCRYNQVSHCPATRRIAVGSHTGQLAIYELRAARCQSLTAHAGPVTACAFSPDGRYLVSYATADNRLSFWQSTAGMFGLGAAQTRCVKCYSTAPMADVSRLNPLHLARLVWTNSRTVTLVLADGSETRFNV-