Monarch geneset OGS2.0

DPOGS204412
TranscriptDPOGS204412-TA1608 bp
ProteinDPOGS204412-PA535 aa
Genomic positionDPSCF300002 - 669249-672537
RNAseq coverage208x (Rank: top 46%)
Annotation
HeliconiusHMEL0062690.067.59% 
BombyxBGIBMGA007717-TA1e-15352.69% 
DrosophilaCG4709-PA4e-7533.27% 
EBI UniRef50UniRef50_E2BTC92e-9439.38%Zinc finger CCCH-type with G patch domain-containing protein n=2 Tax=Formicidae RepID=E2BTC9_HARSA
NCBI RefSeqXP_624406.14e-8937.48%PREDICTED: similar to zinc finger, CCCH-type with G patch domain [Apis mellifera]
NCBI nr blastpgi|3838484489e-9840.26%PREDICTED: zinc finger CCCH-type with G patch domain-containing protein-like [Megachile rotundata]
NCBI nr blastxgi|3838484485e-10139.85%PREDICTED: zinc finger CCCH-type with G patch domain-containing protein-like [Megachile rotundata]
Group
Gene OntologyGO:00056222.8e-09intracellular
GO:00036762.8e-09nucleic acid binding
KEGG pathway 
InterPro domain[336-374] IPR0004672.8e-09D111/G-patch
Orthology groupMCL15867 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204412-TA
ATGGAAGAGTTAGCTTCCTCTATAAATCAATATGAAGACCAACTCTCAGTCGTTAGACAAGCACTGCAAGCTACCCAGGATGCTAATGAACGCGAATCTCTTTCTGTATTGCAATCAGAACTACAAGAGCTTATCAGTCTTACTAAGGAAAGTATGAACGTACAACAAAATGATAACAATCAATGTGACAATGCAACAGAATCTAATGGACTTGATGAAGAATATGCACTCTTTATGAAAGAGATGGATGAAAGTGATACATCAAATGTTAAAGACCACAAAAACACTACAGAAGAGGACAAAAATAGTGATATAGAAGATGAACTGGCTAGTCTTCTTGGGATGAAATGTTCTGTGTATCATACACACACTTGGGGAGGTCAACCCACCTTACACAATGCTATGGTGGGATCTGTAGTGCCTCGGCAAGAGGATGACCAATTTAGTGACTTACAAGTTCAGGTACTTTTTACACATCCAACACACACTGAAATGTTGCCATGCCCTTTTTTCTTGAACGGAGAATGCAAATTCAGTGATGAACAATGTAGGTACTCACATGGAAAACTGGTTCAACTATCCAGTTTAAAGGAAGCAATTGAACCAAACTACGAAGGCTTAAAAGTAGGCAGCAGGATATTAATGAAACTTAAGCCACCAGATGATGAGGATATGAGCACAGTTAAAAAGTCGACAGAAAAATATCATCTTTGGCAGAGAGCTATAGTAATTGGTGTAGATTTAGATAAGAGGACATGTGTCGCTAAGCTAGAACACGGTGGGAAGATTGGAGAGAAAAGAAAAGTTGTTTCAGATGAATTTCATTTGAAGTTTGAAGATTTTTTTCCTCTTGGATCTCAAGATGACAATGACACCGACTCGGACAGTATAAGTGACACAGAGTATCCAGAGTTGAAGAGTAGCAAAAGAAATGATGACCACACCAATTTGATAGGAGACAATTTACATATTAATGCTCCCGCAATGGGGGAGTGGGAACGTCATACTAGAGGCATGGGATCAAAGATTATGTTATCAATGGGTTACATCCCGGGCACAGGCTTGGGTGCAGCAAGTGATGGCCGTTTGTTACCGGTTGAGGCTCACATCATGCCCTCTACAGCCTCCCTTGATAGGTGTATGGAATTGAAACAGAAAGCTTCGGACAAAGATGCATTTTTGGTTGAAAAGAGACTTAAAAGATTACAAAAAAGAGAAGAAGAACGCAGTAAACGTGCCTATGAAAGGGAGAAAGAAATGGAGAGAAGGAATGTTTTTAATTTCCTAAACAATACATTAGGTGATAAAGTTGATGTGGATACAGAACAATCAAAAAGAGCACCAACGGTTGACGTTAAACAATCATCTAGCAAAGATTTGAACATAGAAAAATTCAAGTTGGATGAAGATAGCAAGCGGATTGAATGCGAGATAATTAAACTAAATGGTTCATTAGCCAGATATCCATCACAGAGTAATGGCTACAGAAGCATTAGCATACAAATAGCAGAGAAAAAGAGAGAACTGGATCTGCTGAGAAAGAAAGAGAAGGAAATAACAAAAGAACAAAACCAGAGGAAGGACAAGCAGAAGATGACGGTGTTTTAG

Protein sequence:

>DPOGS204412-PA
MEELASSINQYEDQLSVVRQALQATQDANERESLSVLQSELQELISLTKESMNVQQNDNNQCDNATESNGLDEEYALFMKEMDESDTSNVKDHKNTTEEDKNSDIEDELASLLGMKCSVYHTHTWGGQPTLHNAMVGSVVPRQEDDQFSDLQVQVLFTHPTHTEMLPCPFFLNGECKFSDEQCRYSHGKLVQLSSLKEAIEPNYEGLKVGSRILMKLKPPDDEDMSTVKKSTEKYHLWQRAIVIGVDLDKRTCVAKLEHGGKIGEKRKVVSDEFHLKFEDFFPLGSQDDNDTDSDSISDTEYPELKSSKRNDDHTNLIGDNLHINAPAMGEWERHTRGMGSKIMLSMGYIPGTGLGAASDGRLLPVEAHIMPSTASLDRCMELKQKASDKDAFLVEKRLKRLQKREEERSKRAYEREKEMERRNVFNFLNNTLGDKVDVDTEQSKRAPTVDVKQSSSKDLNIEKFKLDEDSKRIECEIIKLNGSLARYPSQSNGYRSISIQIAEKKRELDLLRKKEKEITKEQNQRKDKQKMTVF-