Monarch geneset OGS2.0

DPOGS202110
TranscriptDPOGS202110-TA2013 bp
ProteinDPOGS202110-PA670 aa
Genomic positionDPSCF300150 - 125197-139075
RNAseq coverage15x (Rank: top 81%)
Annotation
HeliconiusHMEL0080254e-12674.20% 
BombyxBGIBMGA006900-TA3e-5859.06% 
Drosophila% 
EBI UniRef50UniRef50_F4WP231e-3325.91%WD repeat-containing protein 38 (Fragment) n=1 Tax=Acromyrmex echinatior RepID=F4WP23_ACREC
NCBI RefSeqXP_002427200.12e-2223.50%hypothetical protein Phum_PHUM300870 [Pediculus humanus corporis]
NCBI nr blastpgi|3320237305e-3325.91%WD repeat-containing protein 38 [Acromyrmex echinatior]
NCBI nr blastxgi|3320237308e-3325.91%WD repeat-containing protein 38 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055157.5e-29protein binding
KEGG pathway 
InterPro domain[78-473] IPR0110467.5e-29WD40 repeat-like-containing domain
[282-517] IPR0159432.1e-24WD40/YVTN repeat-like-containing domain
Orthology groupMCL19536 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202110-TA
ATGGCGTCTAATCTTTTTAATATATCAACATACAGTCGGTTTAATTTGTCAGAGAGTGAATCTGATATTCAAGTTTTTGTCAAAGAATATGGGTATCGCATAAAAAAACAACTTATTCGTGAGGATGAGAAGGGTCTCTCGGGCTTTGTTTCTTTTGCGGAGAGAAGTTTAAGTTTAACAGAATTGACTTGGTTACGAGACAATGAAGAAAGATTTTTTAAGGATACTCCTAATGAGAATTTGCTACGTTTGGACTATTTGTCCGAAACTGGCTTTATTACCTCTATGATATATAGTCCTGATGGCGAATTTTTAATCGTAGGTCACTCATCTGGTCTAATTCAAGAATCTCCAAAACTTTGGTATCCAATGTTTTATCAAGGTCCAAAAGAAGAGCACAATCGTTTCGCATGGATGAAAGAAAACAGTGATAGATTTTTAAAAGGCACTCGAAATGAGATATTGTTCAATCGATGGTATTGTTCCGAATTGGGCGCTATAACGGCTTTGACCTACTCGCCTAACGGAGGCCACATTATTGTAGGCCATGCGTCAGGGATGGTGCAGATGCGTCATGGTACTACAGGCGTAGTTCTCTGTACTCTGCGTAACATACAATTTCCCCCGAGACCCATCTATGCTATCGAATACAGTCGTTTAGAAGAACGTGTGTGCTATGCCGCCTGCAGCGATGGTGCCATATACAGAATTGAAATACCTAATATCGCAACATCAGTGGACGATCCTCCCGGGTATTGTATTCGTGCCGATCCTGCTTTGGAATGTCTTAACACACAGTTTTATGGCTCTCCTGGAATATCTTCCGCCTCTACCCCGCTTATAACACAGAGATCGCCGGCTCTATCCTTGGGCATATCGGCTGATCAGGCGAAAATCATTGTTGGGTACGCTGACGCCTCGATAAAAGTGTACGATATGGAAACTCTAGAGGCGGACCTAACATATAAGGTCCACAAACTGCGTTTACAGTTTATACCTAAGAAACTTCAGAGAATGCATTTTGGGCAAGTCTGTGCTCTAAGGTGCCACGACCAGAAGCCTTTTATATTTGCATCAGGAGCCTGGGATAATACTTTGAGGATTTGGGATATAAGATGCCAAGTTGGATGCATAATGACTTTCGAAGGAGTAAATATATGTGGTGACAGCATTGATCTGAATCGGGATTATTGCATTTCTGGAAGTTGGCAACCAACTGAAGCGCTGTCGGTTTGGGATTTGACTGCCAAAAAGAGGTTGAGCACCATCAGAGTACAAAACCGACGACCTGATGTTGATGGGGAGTATATTTACGGTTGCCGCTATTGGAGATCTTCAAAGTATAACCGTAAGGGCAAATACGGTATCATAGGCGGTAGTGGAACAAACTGTGTTGAGGTCATCAATCTTCACAATCGCTACATAGCCTGCTCCTATCCGTGTCCTGGAACAGTACTAGCAATAACCAGTCACGAAGAAAGAATAGCGTTCGGAGGTACAGCTCCTTTATTGAACATTGTCAGCTTCCATGATCCAAAGCACGAGAAATATAAAATAAAAGAGGAACAACAATTCGAATTTACCGTGAAAGATGTTACGTGGTTCGACGAAGATCAAAGTTCTGAAGAAGTTGTTGAACATCCACCTAAGTGCGAACACATGCTGGCGCTGGTGAACTTTCAACTTTTACTCGCCCATGTAGCTGGAGACTGTACCCACCTGACCCCCCATGGGAGGGGGCGCTTTCCGTACTTGGAGCGAATATTTACAGCGCAACCTAAATATAGCGTGCCGCTGCTCAGCTGGCACGGTAGAGGAGGGGATAGGCGGGTAAACATTACCCCTCCACTCCTAACCAACGGAAGGCCATTCGGGCGTCGGGCACTGAGGTGTTCGGACGTCTACATTATATGCTTGCGTATAGCGACCATGACCGACAGGGGTGGACGCTCGCGCCAGCAACAAACCTTCTCAAGGTTACGACGCCATTGTATAAGACCACTACAGAATTAA

Protein sequence:

>DPOGS202110-PA
MASNLFNISTYSRFNLSESESDIQVFVKEYGYRIKKQLIREDEKGLSGFVSFAERSLSLTELTWLRDNEERFFKDTPNENLLRLDYLSETGFITSMIYSPDGEFLIVGHSSGLIQESPKLWYPMFYQGPKEEHNRFAWMKENSDRFLKGTRNEILFNRWYCSELGAITALTYSPNGGHIIVGHASGMVQMRHGTTGVVLCTLRNIQFPPRPIYAIEYSRLEERVCYAACSDGAIYRIEIPNIATSVDDPPGYCIRADPALECLNTQFYGSPGISSASTPLITQRSPALSLGISADQAKIIVGYADASIKVYDMETLEADLTYKVHKLRLQFIPKKLQRMHFGQVCALRCHDQKPFIFASGAWDNTLRIWDIRCQVGCIMTFEGVNICGDSIDLNRDYCISGSWQPTEALSVWDLTAKKRLSTIRVQNRRPDVDGEYIYGCRYWRSSKYNRKGKYGIIGGSGTNCVEVINLHNRYIACSYPCPGTVLAITSHEERIAFGGTAPLLNIVSFHDPKHEKYKIKEEQQFEFTVKDVTWFDEDQSSEEVVEHPPKCEHMLALVNFQLLLAHVAGDCTHLTPHGRGRFPYLERIFTAQPKYSVPLLSWHGRGGDRRVNITPPLLTNGRPFGRRALRCSDVYIICLRIATMTDRGGRSRQQQTFSRLRRHCIRPLQN-