Monarch geneset OGS2.0

DPOGS210333
TranscriptDPOGS210333-TA1887 bp
ProteinDPOGS210333-PA628 aa
Genomic positionDPSCF300025 - 473472-479878
RNAseq coverage241x (Rank: top 43%)
Annotation
HeliconiusHMEL0138350.074.48% 
BombyxBGIBMGA011974-TA6e-11879.27% 
DrosophilaCG8270-PB2e-9735.05% 
EBI UniRef50UniRef50_F4WH298e-12138.71%Uncharacterized protein C18orf8-like protein n=8 Tax=Formicidae RepID=F4WH29_ACREC
NCBI RefSeqXP_001607978.11e-12038.75%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3838643082e-12138.56%PREDICTED: uncharacterized protein C18orf8-like [Megachile rotundata]
NCBI nr blastxgi|3838643085e-11538.62%PREDICTED: uncharacterized protein C18orf8-like [Megachile rotundata]
Group
Gene OntologyGO:00055152.8e-06protein binding
KEGG pathway 
InterPro domain[456-614] IPR0097552.8e-30Colon cancer-associated Mic1-like
[23-186] IPR0110462.8e-06WD40 repeat-like-containing domain
[244-273] IPR0110422.9e-06Six-bladed beta-propeller, TolB-like
Orthology groupMCL14082 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210333-TA
ATGGCAAAATTTTGTTTAAAAGATAGTGAAAAATATTACCTCACACTTTCGGAACGACCTAAAAGATTCCACGCTGATAGCCCTGTAACTAACGTGTTCTTTGACGACACTAATGGACAAGTATTCACAGTTAGATCAGGCGGAGTAACTGGAGTCACGGTCAATGGCATGGACGACTCTAAATGCACATCATTTAGAATGGAAGACAAAGGTCCCATCATCTCTATAAAATTTTCTCCAGATCATAAAGTTCTAGCAATACAAAGGAATCTTGAAGGTCAAAATGCCACAGTCGAGTTTGCCAACTTCAAGGATCTCATGCCAACAAATGTTGAATACTGTCATATATGTAAATGGAAAAACGCGAAAATATTAGGTTTTGTGTGGCCAAAAGCCAACGAAATAGCTTTTATAACAGACCATGGCATAGAATTGTCACAAGTACAGCCAGAAAAGAAACAATTGAAGACATTAAAGAGCACTTCGTTCAGCGCTGCCTGGTTTTCTTGGTGTCCACAGAGCAATATAATATTATTGGCCGGCAACAATGGAATTCTGCTTCAACCTTTCTCTCTGAACAACTCTACTATAACAAAGCTGCAGAAGTTGGAAGTGGAGTCGTCTCGGCCGGTTATGGAGCGTGATGTGTTTGTGTTACGTCTCTGTGGTGCCGCTTGGTGTGCACTCTTCAAACATGGACCTGTCTCCAACACCACATCCAGTCCTGGACCCACTGAGGTATGGTTGGTCCCCATATCTGGTGGAGGAGCAGTCCAGCACGTCCTCAAGACTGGGCTGGTGGGGAGGTTCGCTGTCAGTGTGATTGATGACCTAGTGGTCATTCATCACCAGTCCAGTCAGACCTCACTCATATTCGACATAATGGAGGAAGCGAAGCCAGAAAACAACACAGTAGTCCACAAACCTATAGTACAGGGTATATCAATGAGACCAGCTGTCGTTGATGAACAGACCTGTCCGATGTACTCCGGTAACTGGGTGGTGTTTCAACCACATTATGTGATCGATGCCAGGCTCGGTTGTCTGTGGAGGCTGGAGCTCAACTTAGCTGGCCTGGCGCATGAGGTTCCAAAGGAGGATATCTCCAACATAGTCGGTGTTCTGTTGAGAAGGAACAACAGCAAGGAAACCATCTACAGGATCCTGAACCAGCTCGTGGAACACGCTGGGACCTACCTCATGGAACTCACTCACTGCTTTGATGAGATCAACGCTGTCTATAGACGATGGGCGGATTTGGAGGTGGCCCGCAACACGGCAGGCTGTCCGCCGCCCGCCCAGGGACACTTCCCCGCCCTCGTGTCGCAGGCTGACATGTGCACACACGTCCTGTACAAACACTCCCACACACTACTAGTGCAGGTGGTGACGGCATACTTGTCGTCTCTCTCCCGTTACGAGCTGGTGGTGCAGCACGCGGTGTGCGAGCTGGTGGTGCGGTCGCTGGTGAGGGCTGGCGAGGGCGGGCGGCTGCGGGCGCTGGTGAGGCGCGGCGCCCTACAGGACGGGCGACCGCTCGCCTGCCAACTGCTCAGCCTGGGACACTTGGACCCCGCCGCCGCGCAAGTGGCGCTCGACATGATGTGGAGGCTGAGGGCTTATGGGGAGATAGTGGAGGTGTTGCTGAGTCGCGAGGAGCCGGTGAGCGCCGCTGGTGCCGCTCGCCAGGCCGGCGCCTGGGGTTCGCTGGCGGCGAGGAAACTGCTGTCCGCGGCCCAGCAGCACTCGCGGCCCGAAACATTCCTCGCCATATACCACGCACTGAGAACCCGCAACGAAAGACTGAGAGGAACACCGGACTTCCTCACTGAGGAACAATGCGGAGCTTACATCGAGTACTACAAACAACTCATGTCCGAGTCTTGA

Protein sequence:

>DPOGS210333-PA
MAKFCLKDSEKYYLTLSERPKRFHADSPVTNVFFDDTNGQVFTVRSGGVTGVTVNGMDDSKCTSFRMEDKGPIISIKFSPDHKVLAIQRNLEGQNATVEFANFKDLMPTNVEYCHICKWKNAKILGFVWPKANEIAFITDHGIELSQVQPEKKQLKTLKSTSFSAAWFSWCPQSNIILLAGNNGILLQPFSLNNSTITKLQKLEVESSRPVMERDVFVLRLCGAAWCALFKHGPVSNTTSSPGPTEVWLVPISGGGAVQHVLKTGLVGRFAVSVIDDLVVIHHQSSQTSLIFDIMEEAKPENNTVVHKPIVQGISMRPAVVDEQTCPMYSGNWVVFQPHYVIDARLGCLWRLELNLAGLAHEVPKEDISNIVGVLLRRNNSKETIYRILNQLVEHAGTYLMELTHCFDEINAVYRRWADLEVARNTAGCPPPAQGHFPALVSQADMCTHVLYKHSHTLLVQVVTAYLSSLSRYELVVQHAVCELVVRSLVRAGEGGRLRALVRRGALQDGRPLACQLLSLGHLDPAAAQVALDMMWRLRAYGEIVEVLLSREEPVSAAGAARQAGAWGSLAARKLLSAAQQHSRPETFLAIYHALRTRNERLRGTPDFLTEEQCGAYIEYYKQLMSES-