Monarch geneset OGS2.0

DPOGS205133
TranscriptDPOGS205133-TA1626 bp
ProteinDPOGS205133-PA541 aa
Genomic positionDPSCF300246 - 128888-134787
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0026151e-15448.84% 
BombyxBGIBMGA008177-TA1e-10153.27% 
DrosophilaCG6236-PB3e-9334.27% 
EBI UniRef50UniRef50_Q9VF915e-9134.27%CG6236, isoform A n=9 Tax=Sophophora RepID=Q9VF91_DROME
NCBI RefSeqXP_970966.12e-9936.77%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910821734e-9836.77%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910821738e-9736.77%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00081522.5e-07metabolic process
GO:00038242.5e-07catalytic activity
KEGG pathway 
InterPro domain[15-538] IPR0042451.3e-122Protein of unknown function DUF229
[115-396] IPR0178502.5e-07Alkaline-phosphatase-like, core domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205133-TA
ATGAAATACGACGAGCAAATGCCAAAAGTTGTTTGCTATGGCCAAGACTGGGTTGTATGCAACATGTCTGAATGTAGAGTACATGAAGAGATACTGCAAAGCTTGAAGAAGGTTGTATGTCACTATAAAGATATAATATATGTCAATGATAATAAATACGAAATTGCTGATCCAGTGACACGTCACGGCGACGAGGTTTACAAACTGAATAGAAGCGATGGGTTTAAAGTGTCCTGTAGTGGAATTTCAAAGAAATGGTACGAAATATTCCCAAAGAGATGGTTTGGCTACAAAGCAGGCATCAGACAAATAAAAGCAACACAACCTGACGAAAATACAATAAACGTTCTAATCGTGGGTTTCGATTCAACTTCCCATAACGGTATTATTAGAAAGATGCCTAAAAGTTATAAATTTTTGAAGACCGAAATGGAAGCAGTTATACTTAACGGTTACAATATAGTCGGAGACGGTACTCCTGACGCATTGTTCCCAATACTCTCAGGGATGAATGAGTTACAACATCCTCCAGCTAAAAGACGATACTCAAATAACATCTTCCTCGATACTGAACCATTCATATTCCACACGGCAAAACTTAATGGTTATGAAACAGCCTATTTCGAGGATATGCCGTGGGTCGGTACATTTCAGTACAGATTCAATGGATTCCTCCGGCAACCAGCAGATCACTATCTTCGATCCTTCTTAACAGAGGAAACCAAGAGTGGACAGAAGTGGTATTCTGGAATTCAGAATAGATACTGCTTGGGAGAGAAACCACAATACTTATTCCTACTGGATCTTACGAGACAATTTATGAAATTAAATTCAAAGCGTTTCTGCTTTACATTTATGGGCGACATAAGCCACGACGATTTCAATCAGATATCGACAGTCGAAGATGACTTTGTGGACTTTCTGAAACACGTGAAAACAAATCTATTGGAAGATACCTTAGTCATTGTCATGGGTGATCACGGACAAAGATTTGGTCCAATACGCGCGACATTCGAAGGAAAAGTTGAAGAAAGGATGGCGTTTATGTCGATAATACTACCAGAGAAACTGAAAAGGGAACGCAAAGATGCGTTAAATTATTTAAAACAGAACGCAGATGTTCTAACGACGCCATTCGACATACACACGACTATATTAGACGTTATTGGCTTGAAACAACACGCCAGTGATTACGCAGTACCAAACTCTAATATGAAGAGAGGGTTGAGTCTATTGGAGCCGATCTCAGTGTTGCGCACGTGTGCCGATGCTGATATACTGCCTTACTGGTGTGTGTGTATGAATGGTGATTGGAAGACAGTTACCAAAACCGATCCCAAGTTCACGGAGGCTGGGGTCGCTCTCCTCTCATACGTGAACAGAGCTACCAATGACCTAAGAGAACTATGCGCTGAAAGAAAATTGAAATTAATCAGCTGGGTGTTAATCAATGAGAACAAAGACTCGCGAGCGGGACAGAAGATTACGAATTACCAGGCATTGATTATAACAAGTCCAGGGCACGGCATCTTCGAAGGTATGATGGAGTATGATAATGAGAAGGACATGTTTCTTATAAAGTCTGATAAAGATGTCTCCAGAATATCCGCGTATGTGGCTTTATGA

Protein sequence:

>DPOGS205133-PA
MKYDEQMPKVVCYGQDWVVCNMSECRVHEEILQSLKKVVCHYKDIIYVNDNKYEIADPVTRHGDEVYKLNRSDGFKVSCSGISKKWYEIFPKRWFGYKAGIRQIKATQPDENTINVLIVGFDSTSHNGIIRKMPKSYKFLKTEMEAVILNGYNIVGDGTPDALFPILSGMNELQHPPAKRRYSNNIFLDTEPFIFHTAKLNGYETAYFEDMPWVGTFQYRFNGFLRQPADHYLRSFLTEETKSGQKWYSGIQNRYCLGEKPQYLFLLDLTRQFMKLNSKRFCFTFMGDISHDDFNQISTVEDDFVDFLKHVKTNLLEDTLVIVMGDHGQRFGPIRATFEGKVEERMAFMSIILPEKLKRERKDALNYLKQNADVLTTPFDIHTTILDVIGLKQHASDYAVPNSNMKRGLSLLEPISVLRTCADADILPYWCVCMNGDWKTVTKTDPKFTEAGVALLSYVNRATNDLRELCAERKLKLISWVLINENKDSRAGQKITNYQALIITSPGHGIFEGMMEYDNEKDMFLIKSDKDVSRISAYVAL-