Monarch geneset OGS2.0

DPOGS201063
TranscriptDPOGS201063-TA945 bp
ProteinDPOGS201063-PA314 aa
Genomic positionDPSCF300185 - 382157-384072
RNAseq coverage541x (Rank: top 23%)
Annotation
HeliconiusHMEL0098453e-11484.94% 
BombyxBGIBMGA007152-TA2e-15384.81% 
DrosophilaCG3313-PA2e-11760.94% 
EBI UniRef50UniRef50_Q9VGE32e-11560.94%CG3313 n=26 Tax=Neoptera RepID=Q9VGE3_DROME
NCBI RefSeqXP_973684.12e-13068.14%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910765303e-12968.14%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910765301e-12468.14%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00055151.1e-22protein binding
KEGG pathway 
InterPro domain[18-298] IPR0110461.1e-22WD40 repeat-like-containing domain
[17-261] IPR0159432.4e-19WD40/YVTN repeat-like-containing domain
[63-97] IPR0197813.6e-06WD40 repeat, subgroup
Orthology groupMCL13705 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201063-TA
ATGGTGTATGACGTCCGCTCGCGGTCCCTGGACGCGATCCCCACTCTGAGGCCGAGGGCTCCCTGGCCGGGGGCGGAACACCAGACAGGCATACACGCGCTCCAGATCAACCCCTCCCGTACTCTGCTAGCAACGGGAGCCCGGAACTCGTGCGAGCTGGCCATCTATAGCCTGCCGACCCTCGACCCCGTGTGTGTAGGGGAAGCTCACAAAGACTGGATCCTGGACATGTGTTGGCTGGACGACGAGTTCCTGGTGACGGGTTCCCGGGACTCGCGCCTGGCTCTGTGGCGCGCACCGCCCAGCCCGCTCGCCCGCCCAGGGCCCGCCGCCCATCAACACTACGTAGCGCCGGTCGCCGTCAGAGAGTGCCGCGCTGGTCAGAAGGTCCGCGCGTTGACCTTCAACGCCAAGTGGCGTGAGATCGCCGCCCTCACCCTGAACGGCTACATCCACGTGTTCAGCGCGCACTCCTTCCGGCAGACGCTCTCACGGAAACTGCCCTCGTGCCAGGACCTGGTGTGTCTCGCTACACAGGAAAGCGGCATCTACGCGATCGGCTGCAAGTCCTACACCCTGCTGCTAGACTGCCGCACGCTGCAGTCCGTCAAGAAGATAACCTCGAGGTACAACGGCTGCGGCATCCGCTCCGCGAGCTTCAGGGGCGACATGCTCACTATAGGAACAGGCCTCGGTCTTCTGATGTTCTACGACCTGCGAGCCGGGAAATATCTGGAAAGCAACATTCACTCGTCGAAAATCGTCACCCTCAAAGCGTCCAGGGGCTACGTGTTCCCGGACGAGGAAGCGGATGGTTTCAACCAAGGCAAGTACGTGCCAGCGATATACACTCACTGCTACGACGACTCGGGCACGCGCATCTTCACGGCGGGCGGGCCGCTGCCGGCGCCGCTCGTCGGGAACTACGCCGGCCTCTGGCAGTAG

Protein sequence:

>DPOGS201063-PA
MVYDVRSRSLDAIPTLRPRAPWPGAEHQTGIHALQINPSRTLLATGARNSCELAIYSLPTLDPVCVGEAHKDWILDMCWLDDEFLVTGSRDSRLALWRAPPSPLARPGPAAHQHYVAPVAVRECRAGQKVRALTFNAKWREIAALTLNGYIHVFSAHSFRQTLSRKLPSCQDLVCLATQESGIYAIGCKSYTLLLDCRTLQSVKKITSRYNGCGIRSASFRGDMLTIGTGLGLLMFYDLRAGKYLESNIHSSKIVTLKASRGYVFPDEEADGFNQGKYVPAIYTHCYDDSGTRIFTAGGPLPAPLVGNYAGLWQ-