Monarch geneset OGS2.0

DPOGS211713
TranscriptDPOGS211713-TA1392 bp
ProteinDPOGS211713-PA463 aa
Genomic positionDPSCF300423 + 37000-47175
RNAseq coverage1138x (Rank: top 11%)
Annotation
HeliconiusHMEL0096763e-9950.00% 
BombyxBGIBMGA008706-TA7e-10274.50% 
Drosophilaflr-PA5e-9466.54% 
EBI UniRef50UniRef50_C3YJ601e-9741.27%Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YJ60_BRAFL
NCBI RefSeqXP_001661907.15e-9667.83%wd-repeat protein [Aedes aegypti]
NCBI nr blastpgi|2608176754e-9741.27%hypothetical protein BRAFLDRAFT_126885 [Branchiostoma floridae]
NCBI nr blastxgi|2608176758e-9741.27%hypothetical protein BRAFLDRAFT_126885 [Branchiostoma floridae]
Group
Gene OntologyGO:00055157.8e-53protein binding
KEGG pathway 
InterPro domain[26-459] IPR0110467.8e-53WD40 repeat-like-containing domain
[231-461] IPR0159431.9e-30WD40/YVTN repeat-like-containing domain
Orthology groupMCL13835 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211713-TA
ATGTCATATTCAAATAAGTATACGTTTGCGGCTTTGCCGAGAACACAGCGCGGTACCCCTTTAGTGTTGGGAGGTGATCCTAAAGGGAAGCATTTTTTGTATACGAACGGGAATTCGGTAATTATAAGGGATATCGAAAATCCGGCGATATCTGATGTATATACCGAACATTCGTGTCAAGTTAACGTCGCGAAGTACTCACCGAGTGGTTTTTACATCGCGTCTGGAGATGTTTCCGGCAAAGTTCGTATATGGGATACCGTGAACAAGGAACACATTCTCAAGAATGAGTTCCAGCCAATCGGTGGACCGATCAAGGATATCGCCTGGAGTGCTGACAGCCAGCGAATGGTGGTCGCTGGTGAAGGAAGGGAACGCTTCGGACACGTGTTCATGGCTGAGACGGGTACATCGGTCGGGGAGATCAGTGGACAGTCAAAGCCAATCAACTCGGTCGATTTCCGGCCCGCGAGGCCCTTCAGGATCGTCACCGCGTCGGAGGACAACACGCTGGCCGTGTTTGAAGGGCCCCCCTTCAAGTTCAATGTCAAACGCTTTGCTAAAGCTGTCAAGAAAACGATTAACCACCCTTACGTCAACAATGGAACATTATGTGGCTATGTTGGTGAGGCTGTGCACATTTCCGGTCCAGGTCACGGCAACCAGGTGAATGGTATGAAGGCTTGCGAGGACGGCAGTCTATTGACATGCGGCATAGACGATACCTTGAGGAAGGCTGTCCCGGCTAGCGAAGGGGACATACCAACATATTCGTCTGATGCCACACCTCTGGGATCTCAGCCGAAGGCCTTGGATCATCTGGAGGGTGAAGGAACTACAGTTGTGGCTACCGTCAAAGAGCTCCTAGTACTGAAAGGCAATGTTAAGCAATCATCGCTGTCTCTAAGCTATGAACCGAGCTGTGTCACAATAGACCCAGTGTCCAAACACGTTGCTGTTGGTGGTGATGACAACAAGGTCCACATATATTCCCTATCGGATCTGTCACTCATCAATGAGCTGGAACACCTGGGCCCGGTCACAGATGCTCGGTACAGTCCCAACAGTCGCTACTTAGTGGCCTGCGACGCTCACAGGAAGATCATACTATATACTACTGAGGAATATAAGCTAGCTCACAAAGCCGAGTGGGGCTTCCACACGGCCCGCGTGAACCGTGTCGCCTGGAGCCCTGACAGTTTGAGAGTTGTCTCCGGGGCCCTAGACACCTGCCTCATAGTATGGAGCGTCGCTGCTCCGACCAAGCATATCGTTATTAAAAACGCTCACCCACAGAGCCAGATCACGGGTGTGTCGTGGGTCGACGACGAGACCATCGTGTCGGTCGGTCAGGACGCCAACACGCGCGTGTGGTCGGTGCCTAACGCTTAA

Protein sequence:

>DPOGS211713-PA
MSYSNKYTFAALPRTQRGTPLVLGGDPKGKHFLYTNGNSVIIRDIENPAISDVYTEHSCQVNVAKYSPSGFYIASGDVSGKVRIWDTVNKEHILKNEFQPIGGPIKDIAWSADSQRMVVAGEGRERFGHVFMAETGTSVGEISGQSKPINSVDFRPARPFRIVTASEDNTLAVFEGPPFKFNVKRFAKAVKKTINHPYVNNGTLCGYVGEAVHISGPGHGNQVNGMKACEDGSLLTCGIDDTLRKAVPASEGDIPTYSSDATPLGSQPKALDHLEGEGTTVVATVKELLVLKGNVKQSSLSLSYEPSCVTIDPVSKHVAVGGDDNKVHIYSLSDLSLINELEHLGPVTDARYSPNSRYLVACDAHRKIILYTTEEYKLAHKAEWGFHTARVNRVAWSPDSLRVVSGALDTCLIVWSVAAPTKHIVIKNAHPQSQITGVSWVDDETIVSVGQDANTRVWSVPNA-