Monarch geneset OGS2.0

DPOGS214055
TranscriptDPOGS214055-TA1344 bp
ProteinDPOGS214055-PA447 aa
Genomic positionDPSCF300171 - 242072-244186
RNAseq coverage335x (Rank: top 34%)
Annotation
HeliconiusHMEL0128610.073.15% 
BombyxBGIBMGA010388-TA0.083.45% 
DrosophilaCG7275-PA2e-18064.04% 
EBI UniRef50UniRef50_Q9VUN73e-17864.04%CG7275 n=30 Tax=Diptera RepID=Q9VUN7_DROME
NCBI RefSeqXP_974788.10.073.03%PREDICTED: similar to GA20229-PA [Tribolium castaneum]
NCBI nr blastpgi|910768460.073.03%PREDICTED: similar to GA20229-PA [Tribolium castaneum]
NCBI nr blastxgi|910768460.073.03%PREDICTED: similar to GA20229-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055154.2e-59protein binding
KEGG pathway 
InterPro domain[14-355] IPR0110464.2e-59WD40 repeat-like-containing domain
[61-356] IPR0159433.9e-53WD40/YVTN repeat-like-containing domain
[356-443] IPR0072872.7e-34Sof1-like protein
[102-140] IPR0197811.4e-09WD40 repeat, subgroup
[101-140] IPR0016804.3e-08WD40 repeat
Orthology groupMCL13958 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214055-TA
ATGTCCAACTTGAAAATAAAAGTAATAAGTCGTAACCCAGAAGATTATCTGCGTTCCACTAAAAGAGATATTCATAAAATTCCTAGGAATTATGACCCGAGTCTGCATCCCCTTGAGGGTCCACGGGAGTATGTCAGAGCATTAAATGCGGTAAAACTAGAAAGAGTATTTGCGAAGCCTTTTCTTGGCAGTCTTGATGGACACTCGGATGGTGTATCTAGTTTGGGAAAGCATCCTAGCCGATTATCCGCTTTGGCTAGCGGAGCTTTTGATGGAGAAATTAGGATATGGGACTTAACTAGTCGTAAATGTACCAGAAATTTCATTGCACATGAAGGTTGGGTTCGTGCTATTTGCTACACACCAAACGGTCAACAGTTTATGAGTGTTGGTGATGATAAAACAATTAAAACCTGGAAAGCTGATATTCAAGACCCTGATGACGAAGATCCTGTTAATACACTTCTCAGCATGTCAGTGGTATCTGGTATTAGCCATCATAGAGCAAAACCAATATTTGCTACTTGCGGTGAACATTGTCAGTTGTGGGAAAATACTAGGAGTGAACCTGTCAAAGTATTTCAATGGGGAGTAGATAGCCTGCATCATGTTGCATTTAATCAGGTAGAAACAAATCTGTTAGCAGCATGTGCGAGTGATAGGAGCGTTATACTTTATGACTTCCGTGAGTCAGGACCTCTTAGGAAAGTAGTGATGGAACTGAGATCTAATGCACTATCTTGGAATCCCATGGAGGCATATATATTTACTGTAGCTAATGAAGACTATAACCTGTACACATTTGATATCAGAAAACTGAGACAACCAGTGAATGTTCATGTTGACCACACATCTGCGGTGATCGATGTGGATTATGCACCGACTGGGAGAGAATTTGTCGCTGGTAGCTATGATAAGACTGTTAGGATATTCGAGAGCCTTAAAGGACACTCCAGAGATGTGTATCATACGAAGAGAATGCAGAGATTGACATGTGTTAAGTGGACATTGGATAATAAATATATTTTGACTGGATCAGATGAAATGAATATAAGAATGTGGAAGGCTAGAGCTTCGGAGAAACTTGGTGTTCTCAAACCTCGAGAACGTACAGCTCTTAATTATTCGGAAGCTTTGAAGGAGAAATTCAGTGGTCATCCACAGATCAAACGTATAGCTCGTCACAGGCACGTGCCGAAACACATATTGAACGCTCAGAAAGAACTTCGTACTATCAAAGAGAAGAGCAAACGTAAAGAGGGCAACAAGCGCTCCCACAGCAAACCTGGAGCTGTGCCATTTGTACCTGAACGTAAAAAGCATGTCGTTAAAGAAGATGAGTGA

Protein sequence:

>DPOGS214055-PA
MSNLKIKVISRNPEDYLRSTKRDIHKIPRNYDPSLHPLEGPREYVRALNAVKLERVFAKPFLGSLDGHSDGVSSLGKHPSRLSALASGAFDGEIRIWDLTSRKCTRNFIAHEGWVRAICYTPNGQQFMSVGDDKTIKTWKADIQDPDDEDPVNTLLSMSVVSGISHHRAKPIFATCGEHCQLWENTRSEPVKVFQWGVDSLHHVAFNQVETNLLAACASDRSVILYDFRESGPLRKVVMELRSNALSWNPMEAYIFTVANEDYNLYTFDIRKLRQPVNVHVDHTSAVIDVDYAPTGREFVAGSYDKTVRIFESLKGHSRDVYHTKRMQRLTCVKWTLDNKYILTGSDEMNIRMWKARASEKLGVLKPRERTALNYSEALKEKFSGHPQIKRIARHRHVPKHILNAQKELRTIKEKSKRKEGNKRSHSKPGAVPFVPERKKHVVKEDE-