Monarch geneset OGS2.0

DPOGS206912
TranscriptDPOGS206912-TA1806 bp
ProteinDPOGS206912-PA601 aa
Genomic positionDPSCF300001 - 1516434-1521153
RNAseq coverage212x (Rank: top 46%)
Annotation
HeliconiusHMEL0160940.075.82% 
BombyxBGIBMGA012869-TA0.071.43% 
DrosophilaCG32112-PB2e-13071.38% 
EBI UniRef50UniRef50_Q8IQI53e-12871.38%CG32112, isoform B n=19 Tax=Neoptera RepID=Q8IQI5_DROME
NCBI RefSeqXP_001843583.14e-16453.60%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3479697883e-17353.36%AGAP003371-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479697885e-16953.28%AGAP003371-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[16-589] IPR0191492.9e-158Protein of unknown function DUF2048
Orthology groupMCL14147 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206912-TA
ATGTCCGCTAGCAAATTAGATGCTGTGTACCGGAGTATACTATTAACAAAATTTTTTACTAAAGGATGGGGTAAACCAGAGAATTTGCGAAGGTTATTCGAATTCAGAAAAGTTGTATCAAATCGAGATGAATGCTTTAAATTAGTTGAAAGAGATTATCCTGTAACAATAACGAAAGAACAAAATCTGACAGACTGTCGGTTATTGGAGGGATATTTTTTAACACCATTGGAAAGGTATCTACCAGGCATTGTTCCAGAAATTGCTCAGAAAGCTCACTTTCAAATATTACTACCAGTTCACTGGCCGGATCCACGGTGCAAGCCTGTGTGTTTACATCTAGCTGGAACAGGCGATCATTTTTTTTGGCGACGGAGAAATCTTATGGTAAAACCGTTACTAAAAGAGGCCGGTGTTGGAGGTATCATACTTGAAAATCCTTTTTATGGACTACGAAAACCTACTGATCAAGTACGATCCTCGCTTCACAATGTGTCGGACATCTTCGTGATGGGCGGCTGTCTGATCCTGGAATCACTGGTTTTATTTCATTGGTGCGAGCGGAACGGCCTCGGCCCTCTGGGTGTCACAGGGTTGTCGATGGGTGGCCATATGGCTTCCTTAGCAGCAACTAATTGGCCTAAACCACTCGTCTTAGTTCCATGTCTTTCGTGGTCAACAGCTTCTGCGGTCTTCTTGCAGGGCGTAATGTCCCATTCCATTAACTGGGACCTCTTAGAAGACCAGTATATGTCTGATGGCGTCTACAGAGAGAAATTATCGAAAATGGTTACCATAGTGGACGAGGCTTTTCTTGCCGGCAAAAAATTTGCTCGAACTTACACACAGACCATGGAAAACACTGCCGCTATGAAACCGCAGACAAATACAGATGATATTATAGATTCACTGAAATTTAACGTCGGAGCTAAGAAGATGGACATTACTTATAAGGAGAACGCCAAAACAAATGCTCAAATCGTTGACAATAAACAAACTGTAGATAATAAAATCGATTTACCACATAGCGTTAAGAGTGTCGTAGACGTTGAAGACGAGATAATAAAGGAAGAGCTGAGAAAACTTCTGTGTGATAATAAAATAAGTCAAATGTTATATAACAAATTAACATCCAATAAACATTTCGAATTGGAGCCCCAAGACATTGAGTTTATAAACAAACTGGACGATGGGAAACTTAAAGTGTTCCTACTTAAATATCAGAGTGTTTCAGACAACATTTCGAACGAAAAATTACCATTGGATGGCAACAATAACAAAACAGTCAAAATAACTCCGGCGATAGAAAACTCTAATTCAGTAAATCAGAACAAAGAATCCTCTAATGAGAACAAAGCTTTGGTAACTCCTGATGAGAAGATTTCGACGTCTTTAATTAAAAAAGAACGCAAATCCTGGAACGTGTCAGAACTCACTTCGGACCTGTGGGTGAACTTGCCATTCATGAAATCAGGAAAGAAGATAGATATAGGTAAAATCCACTGGCGGGACAGAGAGGCACTGCAGTTTATGCGTGGAATTATGGACGAATGCACACATTTAAGCAACTTTTCTGTGCCATTCGATACATCCCTTATTATAGCGGTCTGCGCGAAACACGACGCCTATGTACCGCGAGAGGACGTCGGTACTTTGGAGGAGATCTGGCCCGGAGCTGAGGTGCGCTACGTCGACGCCGGACACGTGTCCGCTTACATTCTGCACCAATCGCTCTTTAGGTCCTGCATCAAAGAGGCCTTCGAGAGGTCTAAACTAAGGTGGCGAGACGGGAAACACGTCGATTGA

Protein sequence:

>DPOGS206912-PA
MSASKLDAVYRSILLTKFFTKGWGKPENLRRLFEFRKVVSNRDECFKLVERDYPVTITKEQNLTDCRLLEGYFLTPLERYLPGIVPEIAQKAHFQILLPVHWPDPRCKPVCLHLAGTGDHFFWRRRNLMVKPLLKEAGVGGIILENPFYGLRKPTDQVRSSLHNVSDIFVMGGCLILESLVLFHWCERNGLGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFLQGVMSHSINWDLLEDQYMSDGVYREKLSKMVTIVDEAFLAGKKFARTYTQTMENTAAMKPQTNTDDIIDSLKFNVGAKKMDITYKENAKTNAQIVDNKQTVDNKIDLPHSVKSVVDVEDEIIKEELRKLLCDNKISQMLYNKLTSNKHFELEPQDIEFINKLDDGKLKVFLLKYQSVSDNISNEKLPLDGNNNKTVKITPAIENSNSVNQNKESSNENKALVTPDEKISTSLIKKERKSWNVSELTSDLWVNLPFMKSGKKIDIGKIHWRDREALQFMRGIMDECTHLSNFSVPFDTSLIIAVCAKHDAYVPREDVGTLEEIWPGAEVRYVDAGHVSAYILHQSLFRSCIKEAFERSKLRWRDGKHVD-