Monarch geneset OGS2.0

DPOGS206000
TranscriptDPOGS206000-TA1425 bp
ProteinDPOGS206000-PA474 aa
Genomic positionDPSCF300253 - 177730-182319
RNAseq coverage213x (Rank: top 46%)
Annotation
HeliconiusHMEL0146170.076.99% 
BombyxBGIBMGA012658-TA2e-17476.36% 
DrosophilaCG13567-PA7e-5755.84% 
EBI UniRef50UniRef50_Q9UKZ13e-8942.04%UPF0760 protein C2orf29 n=59 Tax=Coelomata RepID=CB029_HUMAN
NCBI RefSeqXP_624811.21e-9145.38%PREDICTED: similar to CG13567-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3323746741e-9444.42%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323746741e-9044.19%unknown [Dendroctonus ponderosae]
Group
KEGG pathway 
InterPro domain[293-466] IPR0193121.1e-119Protein of unknown function DUF2363
Orthology groupMCL13108 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206000-TA
ATGTCTCAACAGCTAATAAATGAAAATAGCAAATATGTTTTAGACTTATTTTCAGAGCAGACTATCGACTCCCAAAGTCTAGAATCCATCTGTGCTCAAGTCCAGAAGAGATTCCCAAAGTCTGAACATTTTAATTTGTGTCTTCTTTTCTCCACTCTCATCGCCGGAGGTGATCTGACATTACCGGGTCAAAGAGTAGTGGCGCTGGCTCTTATACATGACTTCTATAAAGGAGACAATCCGTTCAGTTCATTATATTTACATCTCCTGGACGGCAAGCCGGGACTCATTGCGTTGGCTCCTCAGGAAAGGCTGTTCATAGGACAATTACATGGCTTTGTGCTTGGTAATATTAAAGATGTGTTCAAGAAGACGGCGAAGCAGGTCATGTTGACGGAGGTGTCCGCTAAGGAACTGGAGTTTGATTACTCATCTCTACAGGTGCTGATGACGGAGCGCTCGTCGGACATCAGTTCCATTGCGAAGGCGACTGCACCAGCGTTAGTTCCGCTCGGTGATGGGGCGCCTCCGAGCCGTATGTCTATGAAGGAGTTGTTGGAAGCGTTGATGAGTAACGAGTATCTGCCTTTGCACCGCACCCTGTCGCCGGCGGGGCCGGTCCCTCCCCCCGCCTTCATCATGGACCCCACTGAGATCGCCTTTGCCGGGGAGGCAGTGTGGAAGAACCTCGTGAACCGGGGGGCGTATATTCCCTTGTACGACACCGACATGGAAGGTTTAACAGGACTTCGCCCCGAAAAGCGTGTGACACCCACCACAGAAAGTGCACCCAAGGAAACCAAAGAGAAGTCAGCAGAGAAAACGGAAGAAGTGACAGAAGAGAAGAAAACCGAAGAGAATCCCGTCGAAGAAGCAAAGGAACTGACGGCCATCGCTCTGAAGACGGCCTTGAGTGTTTCTCAACAACAGAGACTGTTGGCGCTACTGGACGACACGCCGGACATCGTGTACGAAATAGGAGTCACGCCCAACCAGCTGCCGGATTTAGTGGAGAACAACCCCATGGTGGCGATATCGGTGCTGCTGAAGCTGATTCACTCCCAGCACATCACGGACTACTTCTCCGTGCTCGTCAACATGGAGATGTCTCTGCATTCAATGGAAGTTGTCAACAGGTTAACGACCTCAGTGGATCTCCCCGTGGAGTTCGTTCACCTCTACATCAGTAACTGCATCTCAACCTGTGAGACGATCAGGGACCGCTACATGCAGAACAGGCTGGTGCGACTGGTGTGCGTGTTCCTCCAATCACTCATAAGGAACAAGATCATTAATGTTAAGGAACTATTCATAGAGGTGGAAGCATTCTGCGTCGAGTTCAGCAGAATACGAGAAGCAGCGGCGTTGTTCAGACTCCTCAAGCAATTGGACTCTGGAGACGCTCACAAGGATGGAAAGGATTAG

Protein sequence:

>DPOGS206000-PA
MSQQLINENSKYVLDLFSEQTIDSQSLESICAQVQKRFPKSEHFNLCLLFSTLIAGGDLTLPGQRVVALALIHDFYKGDNPFSSLYLHLLDGKPGLIALAPQERLFIGQLHGFVLGNIKDVFKKTAKQVMLTEVSAKELEFDYSSLQVLMTERSSDISSIAKATAPALVPLGDGAPPSRMSMKELLEALMSNEYLPLHRTLSPAGPVPPPAFIMDPTEIAFAGEAVWKNLVNRGAYIPLYDTDMEGLTGLRPEKRVTPTTESAPKETKEKSAEKTEEVTEEKKTEENPVEEAKELTAIALKTALSVSQQQRLLALLDDTPDIVYEIGVTPNQLPDLVENNPMVAISVLLKLIHSQHITDYFSVLVNMEMSLHSMEVVNRLTTSVDLPVEFVHLYISNCISTCETIRDRYMQNRLVRLVCVFLQSLIRNKIINVKELFIEVEAFCVEFSRIREAAALFRLLKQLDSGDAHKDGKD-