Monarch geneset OGS2.0

DPOGS209660
TranscriptDPOGS209660-TA1881 bp
ProteinDPOGS209660-PA626 aa
Genomic positionDPSCF300544 + 16500-22981
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0026150.072.53% 
BombyxBGIBMGA008177-TA2e-11251.92% 
DrosophilaCG6236-PB1e-12337.46% 
EBI UniRef50UniRef50_Q7QAP46e-12339.09%AGAP003596-PA n=6 Tax=Culicidae RepID=Q7QAP4_ANOGA
NCBI RefSeqXP_970966.11e-12536.74%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910821732e-12436.74%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1700703872e-12338.28%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00081521e-07metabolic process
GO:00038241e-07catalytic activity
KEGG pathway 
InterPro domain[7-625] IPR0042453.1e-159Protein of unknown function DUF229
[165-457] IPR0178501e-07Alkaline-phosphatase-like, core domain
Orthology groupMCL15885 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209660-TA
ATGAGTCTCTCTTATGGTTTCCAAAGGGAGGAGAAGCTGGACATTACGAGACGGTATATGGATAGACAATTAGAACAATCCCTGAAACTGGACAAGGGTTCCGGATGTGAGATACCCAGGTTAGATCCATTCCCGAAAGAAGTCACGCAGTTTGATAAGGATATACCCAAGGTTATGTGTTTGGGAACGGATTGGGTTAAATGTTACCAAACCAAATGTAGGGTAGTTCCTAAAATATTAAATACCACCGATAACATAGTGTGTTCATATCAAGACATAATATATGAGAGTGACCAAAAATATACAATAGGACCTCCCGTAGAGGTCAGAGGGGATAACGAATATGTTCTCACTAAGAGCGATCACGTGAAGATTAAGTGTTCGGGAAAACATAGAGACAGTATACTCCCCTCCAAATGGATCGGCCACTCTCTTGGCTTGCGTTCTACTGGCTATGCAAAACTCTCCCCGCAGGGAAGAGGTGACTCCTTGAACGTTCTCATCCTGGGATTCGACTCCACCGCCAAAAACGGTTTCATACGAAGAATGCCGAAAAGCTATAAAGTTTTAAAGGAAATATTGGGAGCTACGATTTTGAATGGGTACAATATAGTAGGCGATGGCACGCCGGCTGCCTTATTCCCGATCCTGACGGGGAAGACAGAGCTGGAGCTGCCGGATGTGAGGAAGAAGATGAAGAACAACAGAACCTTGGACTCCATGCCCTTCATATTCTATAAGTTGAAAGATGAAGGTTACCGAACAGCATTCTTTGAAGACATGCCCTGGATAGGTACATTTCAGTACAGATTCAATGGTTTCAAAAAGCAACCTGCGGATCATTACTTGCTGGCGTTTTACATGGAGGAGTCGAACGGTGGCAAGAAGTGGTGGACGAGCAGCCAGAACAAATACTGCGTGGGAGACACGCCGCAGTATAGACTGATGTTGGATATTACGGATCAGTTTCTCCGTCTGGATGGAAAACGTTTTGCTTTCACGTTTATAGTTGACATATCCCATGATGATTTCAACATGATATCCATCGCTGACGATGATACCGCAGATTATCTTAGAAGGTTCCACGACCGCTATAGAGAGGACACCTTGTTGATTGTCATGGGGGACCATGGACCAAGGTACGCAAACGTTCGAGATACTCTTCAAGGGAAACTCGAAGAGAGATTACCGCTCATGGCGATCAGACTACCAGACAAACTGACGAAGACCAGAACAGAGGCGGAGAAGAATCTGAGGAACAACGCGGAAGTGTTGACGACACCTCACGACATATACGCCACGGTCTTAGATGTTCTGGACCTGACTCAGTTCACTAATCCCTACAAAGTTAAAGGAGCCGACCTAACCAGAGGACTTAGTCTTTTGGAACCGATACCAAAGAACAGGTCGTGTAGCGAGGCCGGTGTGGAGGCTCATTGGTGTTCCTGTCTGTCCTGGCAGAACGTCTCTGACGATGACGTCATGTTCAGTAGGACGGCCGCCGCGCTGGTCGACTTCATCAATCATCTCACTGAGGAGAGGCGGTCGGTTTGCGCGGTGCGCACGCTCAAGTCGGTGTCGTGGGTGATGCGAGCGCGGCCCAACAGCGGTGTACTGACCTTCGTCGAGGCTCGCGATCAAGACGGATATGTCGGCAAGTTTGGTAACAGAGTGAAACAGACCAGGGAAAACTACCAGCTCAAGATCGCAGTGGGACCCGGCCATGGTATATATGAGGCGTTAGTGACTTACGTCATTACTGAGGATAGATTTGAAATCAATACGAGAGAAATATCACGGACTAACGCTTACAACAACGAGCCGAGCTGCATCAGCGACACTCACCCGCACCTCAACATGTACTGCTACTGTCGTCACTAG

Protein sequence:

>DPOGS209660-PA
MSLSYGFQREEKLDITRRYMDRQLEQSLKLDKGSGCEIPRLDPFPKEVTQFDKDIPKVMCLGTDWVKCYQTKCRVVPKILNTTDNIVCSYQDIIYESDQKYTIGPPVEVRGDNEYVLTKSDHVKIKCSGKHRDSILPSKWIGHSLGLRSTGYAKLSPQGRGDSLNVLILGFDSTAKNGFIRRMPKSYKVLKEILGATILNGYNIVGDGTPAALFPILTGKTELELPDVRKKMKNNRTLDSMPFIFYKLKDEGYRTAFFEDMPWIGTFQYRFNGFKKQPADHYLLAFYMEESNGGKKWWTSSQNKYCVGDTPQYRLMLDITDQFLRLDGKRFAFTFIVDISHDDFNMISIADDDTADYLRRFHDRYREDTLLIVMGDHGPRYANVRDTLQGKLEERLPLMAIRLPDKLTKTRTEAEKNLRNNAEVLTTPHDIYATVLDVLDLTQFTNPYKVKGADLTRGLSLLEPIPKNRSCSEAGVEAHWCSCLSWQNVSDDDVMFSRTAAALVDFINHLTEERRSVCAVRTLKSVSWVMRARPNSGVLTFVEARDQDGYVGKFGNRVKQTRENYQLKIAVGPGHGIYEALVTYVITEDRFEINTREISRTNAYNNEPSCISDTHPHLNMYCYCRH-