Monarch geneset OGS2.0

DPOGS213468
TranscriptDPOGS213468-TA1368 bp
ProteinDPOGS213468-PA455 aa
Genomic positionDPSCF300100 - 303801-306461
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0168397e-11569.84% 
BombyxBGIBMGA004492-TA2e-5778.57% 
DrosophilaCG11253-PA1e-6833.12% 
EBI UniRef50UniRef50_E0VHA04e-7838.13%Zinc finger protein MYND domain-containing protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VHA0_PEDHC
NCBI RefSeqXP_001182407.16e-7934.60%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3800113275e-8438.24%PREDICTED: zinc finger MYND domain-containing protein 10-like, partial [Apis florea]
NCBI nr blastxgi|3800113271e-8438.24%PREDICTED: zinc finger MYND domain-containing protein 10-like, partial [Apis florea]
Group
Gene OntologyGO:00082701e-09zinc ion binding
KEGG pathwaytet:TTHERM_010086302e-53 
 K06045 (E5.4.99.17, sqhC, shc)maps-> Steroid biosynthesis
InterPro domain[411-447] IPR0028931e-09Zinc finger, MYND-type
Orthology groupMCL12366 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213468-TA
ATGGCTGAAAAGGAGGTACCATTAAGCGCTTTAGATGCAGGGGAATTAGAGTTGTTTATACAGAGTATGACATCCTGGCCGATAGAAGCTATCGGCAACCAAGCGTGGACTGATTGGCATATACGACTTCAAAAATTAAATCAACAAGCAGTCTTAGAAGCGTCTACTATGCAAGAAGAACTAACAAAAGAGACTCTTATATCTTATGGCAAGCTACCGAATTTAGTTTATGAAGTTATATGTATTCAAGTATGGCGTCTCAAAATATACCCACAGATTATGAAACTTGAACCGGCGCCTGCAAACACTTTTGGTATATATATGGTTTTGTACCACGAAGCAGCTGCAGTGGGACTTCTCGAAACTGTATTATTTCACGAAGATGGCGCTCAATGTATCAGCGAAGTAGTGATAGATCTTCTTAATTATTCCGTTGACCAATTAACAGCTCTGGTTGCTCTTATCAATAAAGGATATTTAAAGCCTGTTACTGCCAAAGAGTTGGATTGTGAAACGACTCCAGAAGAATTGGAACGACAAAAGAATGATCTCCAGTTCGATATAAGCATGAGATGTATTTCTATTGTGCGTTACATAGCCGAGCATATGGAAGTGGCGGGAGTGGGCGCTTCGATTTCAACAAATTTATATAAAACTTTCGATGTGCCTTCTATTTTGTGCCATCTTCTACAGTTGGAGCCATGGAAGAAAGTCAATGACAATGGAGATATGCAGATATTTAATTTCGGTCGTTGGTCGAAACCATCAGCCGACGATCTTTCTCAACTTCATCGCAGTGAGGCGCAGTTGTGGTTGTGTCTAAGACAACTTTTACTGGAACCGCGTCTATCCCATTACTATACTATCGACGAGTGCCGACGTTCAGCTTTTTGCGCGCTGCAAGCGAAACTAAGTGACGCCGTATTAGACCAAGTTCCACCTTTGGGAGACCTGAAGATGTTCCTTTGTCGTCTTGCAGTAGGAGATTATTCTAGTCTTCATACCCGGACTAACGGCGTCAAAAACCCAGGTTGTACGCTGATTGAGGTCGTGCCACAGATTAAAGAAAAATATCTCAAAGAAGTACACAAGAAGACTAAAACGTTGGCCAAGGCGCAATTGGATCATTTTAATATGGACGGGTCTGATGCGTCAAGGAATATGGCCAAGAAACTACTGGAGTCGTATACTAGCGATGCGGCGCTAGCGTTGGACAGTGGGGGGGCGAAGTGCGCCAAATGTGGCGACAAGGCTAGCAAGAAGTGTTCGAGATGCAAGACGGAGTGGTATTGTGGCAGGGAATGTCAAGTGAAGCAGTGGCCGAAACATAAGGACATTTGCGACCAATTCGCTAAACTCTGCGTTTAA

Protein sequence:

>DPOGS213468-PA
MAEKEVPLSALDAGELELFIQSMTSWPIEAIGNQAWTDWHIRLQKLNQQAVLEASTMQEELTKETLISYGKLPNLVYEVICIQVWRLKIYPQIMKLEPAPANTFGIYMVLYHEAAAVGLLETVLFHEDGAQCISEVVIDLLNYSVDQLTALVALINKGYLKPVTAKELDCETTPEELERQKNDLQFDISMRCISIVRYIAEHMEVAGVGASISTNLYKTFDVPSILCHLLQLEPWKKVNDNGDMQIFNFGRWSKPSADDLSQLHRSEAQLWLCLRQLLLEPRLSHYYTIDECRRSAFCALQAKLSDAVLDQVPPLGDLKMFLCRLAVGDYSSLHTRTNGVKNPGCTLIEVVPQIKEKYLKEVHKKTKTLAKAQLDHFNMDGSDASRNMAKKLLESYTSDAALALDSGGAKCAKCGDKASKKCSRCKTEWYCGRECQVKQWPKHKDICDQFAKLCV-