Monarch geneset OGS2.0

DPOGS215273
TranscriptDPOGS215273-TA1791 bp
ProteinDPOGS215273-PA596 aa
Genomic positionDPSCF300047 + 507905-511700
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0080181e-6330.58% 
BombyxBGIBMGA008839-TA0.064.97% 
DrosophilaCG8503-PA6e-10737.11% 
EBI UniRef50UniRef50_D6WUY04e-12742.29%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WUY0_TRICA
NCBI RefSeqXP_973542.26e-12842.42%PREDICTED: similar to CG8503 CG8503-PA [Tribolium castaneum]
NCBI nr blastpgi|1892398481e-12642.42%PREDICTED: similar to CG8503 CG8503-PA [Tribolium castaneum]
NCBI nr blastxgi|2700119012e-13141.95%hypothetical protein TcasGA2_TC005992 [Tribolium castaneum]
Group
Gene OntologyGO:00082701.5e-09zinc ion binding
KEGG pathway 
InterPro domain[7-43] IPR0028931.5e-09Zinc finger, MYND-type
Orthology groupMCL17739 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215273-TA
ATGGCAGTGGAGGAGCCTTGCGAAGTATGTTTGACGCCAACAGAGCAAAAGTGTTCAGGGTGTCAGATAGTCCATTATTGTTCGAGGGACCATCAAAAACGACATTGGAAGCTACACAAGCTTCATTGTGTTCCAGCCAAGATCAAAGAACTTCCCGGCATAGGGAGATATCTTGAGGCAACAAGAACTGTAAAGGCCGGCGATGTTATAATGAAAGAAGCTCCTCTTATTACTGGTCCAGCTCAAATAACCCCACCAGTGTGTCTGGGGTGCTACGTGTTATTGGAAGAGGGTAAAACGACGCGTTGTGATAAGTGCGGCTGGCCATTTTGTTCTGAAAAATGTACCCATAATTCAGTTCACGAGCCAGAATGTTTTTATACCCAACAAAGAGGTGAAAAGATAACGCCACCAGTGTGTCTGGGATGCTACGTGTTATTGGAGGAGGGTAAAACGACGTCTTGTGATAAGTGCGGCTGGCCATTTTGTTCTGAAAAGTGTACCCATAGTTCAGTTCATGAGCCAGAATGTTTTTATACCCAACAAAGAGGTGAAAAGGTGTCGATATCAACGTATGGAATACCACATCCTAATTACCAATGTGTCACGGTATTACGGTGCTTGTACCAACGAGATCACAACATGAAGCTGTGGAATAAGCTGCAAGCACTGGAGACACATACAGAGGATAGGAAGGGCACTGATAAGTGGGAAAACGATAGAAAGATGGTAGCACAATTTATTTGGAACTTTTTCAAATTGGAACGACTATTCAATGAGGAAGAGATTATGAGATGCTGTGGCATTATTCAGATTAACGGCCACGAGGTACCGCTTCTTGAACCGGAATATGTGGCTGTGTTTGACAAAATTTCTATGGTAGAGCACAATTGTAGGGCCAATTGTAACAAAAGCTTCACCTCTAATGGCGAAATCATCCTTAGTGCTGGTGTTGCCATTCCTCGAGGCTCCCATATATCGGTCTGCTACACGGACCCGCTTTGGGGCACGGAAGCGCGCCGACACCATCTCTCCGATTCTAAATTTTTTGAGTGCTCTTGCGAGAGATGCTCTGATGTCACCGAGTTCGGAACCATGTTTAGTGCTGTGAAGTGTAAAAAGAAAAATTGTAAAGGATATCTTTTACCAGACACATTCATTCTACCAATTTTACATAAAACTATATCACCGGATCCCGCAACTAAAGGATTGGATAATAAGTTTTGGAAATGCAAAGAGTGTAAGGATGAGGTTTCAGATGAAATTATACAGCAACTGCTTCAAGATATAGGAAGAGAATTGAGCATTATGGAGAAAGGGAATCCAGATGCATGTGAGAGGTTTATAGAGCACTGTGCAACCTACTTACATCCTTCACACTACTACATGATAGATGCAGGACTAGCGCTGGCACAGCTGGTGGGTCAAGACACAGGTCTAGCAGTTGTCTCAGACGATCGACTGCTGCTCAAAACGCAGCTGTGTCGGAAGATCACCGAGTTGTTGAATATCTTAGCGCCAGCTGAGAGCAGGGTACGCGGTTCCCTTCTGTTCGAGTTACACGCGGCCGTGGCTGAAACCGGACGAAGGAAAGGTCTTGTGGAAGGACCCAACGTCATGCTGGGATACATTTTGGAGTCGCAGAAAATCCTGGTGGAGTCATCAGCATTGCTGGCACACGAACCACCAGAGTTACCTGAAGGCCGCCTGGGAAGACAGGCGAAAGTAAACCTGGTGCAGATGGACGAGCTTATTAGGAATTTATCCGCAGCCCTGCCCTCGCCTATATAA

Protein sequence:

>DPOGS215273-PA
MAVEEPCEVCLTPTEQKCSGCQIVHYCSRDHQKRHWKLHKLHCVPAKIKELPGIGRYLEATRTVKAGDVIMKEAPLITGPAQITPPVCLGCYVLLEEGKTTRCDKCGWPFCSEKCTHNSVHEPECFYTQQRGEKITPPVCLGCYVLLEEGKTTSCDKCGWPFCSEKCTHSSVHEPECFYTQQRGEKVSISTYGIPHPNYQCVTVLRCLYQRDHNMKLWNKLQALETHTEDRKGTDKWENDRKMVAQFIWNFFKLERLFNEEEIMRCCGIIQINGHEVPLLEPEYVAVFDKISMVEHNCRANCNKSFTSNGEIILSAGVAIPRGSHISVCYTDPLWGTEARRHHLSDSKFFECSCERCSDVTEFGTMFSAVKCKKKNCKGYLLPDTFILPILHKTISPDPATKGLDNKFWKCKECKDEVSDEIIQQLLQDIGRELSIMEKGNPDACERFIEHCATYLHPSHYYMIDAGLALAQLVGQDTGLAVVSDDRLLLKTQLCRKITELLNILAPAESRVRGSLLFELHAAVAETGRRKGLVEGPNVMLGYILESQKILVESSALLAHEPPELPEGRLGRQAKVNLVQMDELIRNLSAALPSPI-