Monarch geneset OGS2.0

DPOGS200590
TranscriptDPOGS200590-TA1752 bp
ProteinDPOGS200590-PA583 aa
Genomic positionDPSCF300076 - 789232-794267
RNAseq coverage731x (Rank: top 18%)
Annotation
HeliconiusHMEL0038600.066.19% 
BombyxBGIBMGA011278-TA5e-17957.45% 
Drosophilaadp-PA6e-6243.59% 
EBI UniRef50UniRef50_A7T2824e-9840.37%Predicted protein n=6 Tax=Eumetazoa RepID=A7T282_NEMVE
NCBI RefSeqXP_001622036.18e-9940.37%hypothetical protein NEMVEDRAFT_v1g221254 [Nematostella vectensis]
NCBI nr blastpgi|2608079937e-10441.73%hypothetical protein BRAFLDRAFT_74529 [Branchiostoma floridae]
NCBI nr blastxgi|2608079932e-10140.68%hypothetical protein BRAFLDRAFT_74529 [Branchiostoma floridae]
Group
Gene OntologyGO:00055151.2e-18protein binding
GO:00054885.6e-18binding
KEGG pathway 
InterPro domain[424-509] IPR0159431.2e-18WD40/YVTN repeat-like-containing domain
[282-402] IPR0119905.6e-18Tetratricopeptide-like helical
[41-508] IPR0110466.8e-14WD40 repeat-like-containing domain
Orthology groupMCL14019 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200590-TA
ATGTTCAACGGTTATTATTCCAGTCAGCACGACCTTCGTACACCACACCGTTGTAATTCCGACTCGGCCAATGTGTTGGTGAACCTGTTGAACCACTTGGGCAAGTACGCTGAAGCCAAGTGTCTGGCGGTGAACCCTCGACGGCCATATCAGCTCGCGGTTGGAGCAAACGACTTCTACGTTCGACTCTACGACACCAGAATGATCAAATTAGCCAAATTACTGGAGACTCCGGCGGGCTCCGCGCCCTCGGGTTTGATGTGGGAGAGACAGAATGTGAGGTGCTCCCGTGCCGGCCACGGAGACCCCGACGAGAACATTCCACGTGAAGCTGTTCAGTACTACGCCCCCGGGCACCTGTCCATGGAATTGAATGAGAACACATTCCCAAAGAAAGCTACAACATATGTGGCTTTCAGTCACGATGGCAACGAACTCCTCGTCAACCTGGGATCGGAACAGGTGTATCTGTTCGATATCAACTCGGCCAGGCGTCCGGTGCTGGTGGAGAGCTTCATAATCCAGCACAATCACAGCCGTCGCGAGGAGGCCGCTCAGGAAATGGTTCCCCCAACAGAAACCAGCGGAGAGAACGGTACCATAGACCAGCAGGTCGTACTGCCCGACAAGATACAGCACCTGAAATCTGTGAGCGTGAAGTGCGGAAGGTGCGGGGCGGGGGAAAACAGTATAAACCGGTTCAAACCTAGGGAAGCACGTGGCACGGACAAGTTTGACATCACACCTACGCTGCTGGGTGCACTCACACTAATGCAAAGCTCAGCTAACGCCTCACGCGCTGTACAGCGACATCGTAATACAACTGTTATGTGCCGACCTCGAGGGAAGGCTAACGAGATGGTGAACAGCGGCAGTTTCTGTATGGCGGTGGACACCTACAACATGGCCATACACGAGTATCCAAACTGCGCCTTGCTCTACTCCAACAGGGCCGCCGCGCTCATGAGGAGGGGCTGGTCCGGTGACACGTACGCGGCTATACGAGACTGCTACCAGGCGATCAAGTTGGACCCCGGCCACGTCAAGTCCCACTTCCGGCTGGCCAAAGGCCTGATGGACCTGAAGCGGGCCCGCGAGGCCCACGAGTGTCTGTTGTATTTCAAGGACAAGTTTCCTAAACACGCCTCCAGTCACGCTGTGTTCCTGCTGCAGAAGGACATTAAAGTGGCGCTCAAGACCTTGGAGCGAGATGAAGGTGACGATGACAGCCAGGCCGGTAGTCCCCTGGAGCGCCAACTCCGGATGTCTTCGTTGGACTACTCGTCACGTTTCCTGGGCCACTGTAACACTACCACCGACATCAAGGAGGCGAACTTCCTGGGACCCAATGCTGGATTTGTGGCCGCTGGCCTGCTCGGCAGTATGTTCATCTGGTGCCGCCACACGGGGAACATCGTCCGCTGTCTCCGTGGAGACGAGTCGATAGTTAACTGCGTGCAACTGCATCCGTCCATGTTCCTGTTGGCGACCAGTGGCATCGAGGCCGTGGTTCGCCTATGGAGCCCGAGACCGGAAGACGGCTGTCCCGACACCAGGGCCGTCAGCGAAGCGAGCGCCGCTGCCGCCGCCAACCAGCAGCGGATGCGCTCTGACCCGTTTGAAGCTATGCTGCTCAATATCAGCTTTGCAGGAGGCGCAGACAGAGATGTTCACTCGCCGGCCTGTCGAGCTACGTTAATGAACTCACACCTGTGCGATGGAGAGCCCGGAGTCTGTGGCGATTGTAAATAG

Protein sequence:

>DPOGS200590-PA
MFNGYYSSQHDLRTPHRCNSDSANVLVNLLNHLGKYAEAKCLAVNPRRPYQLAVGANDFYVRLYDTRMIKLAKLLETPAGSAPSGLMWERQNVRCSRAGHGDPDENIPREAVQYYAPGHLSMELNENTFPKKATTYVAFSHDGNELLVNLGSEQVYLFDINSARRPVLVESFIIQHNHSRREEAAQEMVPPTETSGENGTIDQQVVLPDKIQHLKSVSVKCGRCGAGENSINRFKPREARGTDKFDITPTLLGALTLMQSSANASRAVQRHRNTTVMCRPRGKANEMVNSGSFCMAVDTYNMAIHEYPNCALLYSNRAAALMRRGWSGDTYAAIRDCYQAIKLDPGHVKSHFRLAKGLMDLKRAREAHECLLYFKDKFPKHASSHAVFLLQKDIKVALKTLERDEGDDDSQAGSPLERQLRMSSLDYSSRFLGHCNTTTDIKEANFLGPNAGFVAAGLLGSMFIWCRHTGNIVRCLRGDESIVNCVQLHPSMFLLATSGIEAVVRLWSPRPEDGCPDTRAVSEASAAAAANQQRMRSDPFEAMLLNISFAGGADRDVHSPACRATLMNSHLCDGEPGVCGDCK-