Monarch geneset OGS2.0

DPOGS211125
TranscriptDPOGS211125-TA1989 bp
ProteinDPOGS211125-PA662 aa
Genomic positionDPSCF300007 - 394096-397054
RNAseq coverage220x (Rank: top 45%)
Annotation
HeliconiusHMEL0172400.080.95% 
BombyxBGIBMGA003001-TA0.071.43% 
DrosophilaCG9246-PA3e-17550.31% 
EBI UniRef50UniRef50_D6WYX10.054.07%Putative uncharacterized protein n=16 Tax=root RepID=D6WYX1_TRICA
NCBI RefSeqXP_968623.10.054.07%PREDICTED: similar to CG9246 CG9246-PA [Tribolium castaneum]
NCBI nr blastpgi|910894650.054.07%PREDICTED: similar to CG9246 CG9246-PA [Tribolium castaneum]
NCBI nr blastxgi|910894650.053.47%PREDICTED: similar to CG9246 CG9246-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[14-624] IPR0053434.5e-226Uncharacterised protein family UPF0120
Orthology groupMCL15332 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211125-TA
ATGAAGCTTAAGAATAAAAAACCAGCTATGCCGGTAGCTGAACATTCAGATTCCGAAGAAGAATTGGACCCTGAGGCACATAAGAAATCGTTACAAAAATTAAAAAAGATCGATCCAGATTTTTATAATTTCTTAGAAGAAAACGATGAAAATCTTTTGAACTTCGAGGCAGATTCTGACGAAAATTCTGATAAAGAAAATGACGATGAAGACTCAGTGCATGTTCCTGGACCTATTACGGGTGACAGTGACGAAAGTGATTTTGAGGATGATAATGCCAAATCTGTCAGTGGGAGAGTCACACTTAAAATGGTGTCTCAGTGGCAGACGGAACTACAAAGTGAAGGTAAAATGAAAATAACAGTTCTCAGCCAAGTTATAAAGGCATTTAATGCGGCCATGCTGAGAGCCACAAGTGAAGACGGCGAATCAAAAGGGGAATTAAAAGTTGAAGGTTCGTCGGTGTTCAATGCCGTAATACAAATGTGTGTATTATATTTACCCGGCGCCATAAAGAGATACTTGGGTATGGAACAGAGCGGCAAGGATCCCCAGAAATGTAAACATTTCGTGAAGTTGAAAGGTCCCCTCGTTGCATATTTGAAGGACCTATTAAAATTACTAAGCAGCGTTTCATCTGAGAATATTCTCACCGTTCTTCTAAAACATTTGCACCAGCTGGCAGTGTATGTTGCCTGTTTCAATAGCATCTCCAAACAGGCTCTTAAGAAATTAATACCTTTGTGGAGCGGTAGCGAGGAAACTGTCAGGGTGTTATCTTTTTTGTGCATATTGAGAATAACAAGGAATCAACAGTCAGCCCTTCTGGATCTCGTTCTGAAGGCCATGTATATGACATATGTGAAAAACTGCAAATTCGTTAGCCCTACCACCTGGCCAGGGATAAACTTTATGAGACGATCCCTCGTGGAGATGTTCTCCTTAGACCTTAATGTTTCCTACCAACACGTATTTCTCTACATCCGACAGCTCGCAATACATTTACGAAACGCAATTGTTGTTCAGAAAATTGAAAACAGACAAGCCGTATACAATTGGCAGTTTGTGAATTCCCTCCATCTGTGGGCCGATCTAATATCGGCCACTTCAAATAAGCCACAACTGCAGCCACTGCTTTACCCTTTGGTTATGGTGATAACGAATACCATAAAACTGGTGCCAACACACCAATATTATCCATTAAGATTCCATTGCGTTGAAATATTAATTAGTCTATCAAAGGAAACCGATACCTTTATACCAATTCTTCCATTCATTGTTGAGATTTTAACAGCTTATGATTTTAATAAGAAAAATAAAAAAATGTCAATGAAGCCATTAGATTTCTCTTGTATATTAAGATTGGCAAAATCTCAGCTTATGGAGAACGGTTTCAAAGACTCAGTCATCGATAGACTGTATGCACTGTTACTGGAATACACGGCCAGTATATCAAATAGCATTGCTTTTCCAGATATCAGTCTACTCGCAATAATACAGATGAAGCAGTTTTTGAAAACATGCACCGTTGCGAATTATACGAAAAAGATAAGACAGCTTCTGGAAAAGATAGAAGAGAACTCTAGGTTCATTGAGCGTGAAAGAGGTCAGATAACTTTCGGTTTGAACGAGATAAAAATGGTAGCGGCTTGGGAATCCAGGATAAAAGCTAAAGGCACACCCCTGATGGCGTTCTATGAGAGTTGGAACAAGGTTAATAGAATACAGAAACGAAAGAAGATCACCAACAATGATGAAATTGCTGGGGGATTACCGATGATCAAGAGACCTAAGGTTCCTGAAACAGAAGCAAAAATAACTAAGCCTGAAAACGAGGGGCCAATGGTGCTATTTCCGTCTGATAGTGAAGATGGAAACGTTGACTTCAAAGTTGATGGTGAAGATGAGAGTTTAAACAAACCGAAGAAGAAAAAGCTCAGGAAGAAGAAAATAAAGCACAGAAGCAAGAGAAGAATCTGCCGGAAGTAG

Protein sequence:

>DPOGS211125-PA
MKLKNKKPAMPVAEHSDSEEELDPEAHKKSLQKLKKIDPDFYNFLEENDENLLNFEADSDENSDKENDDEDSVHVPGPITGDSDESDFEDDNAKSVSGRVTLKMVSQWQTELQSEGKMKITVLSQVIKAFNAAMLRATSEDGESKGELKVEGSSVFNAVIQMCVLYLPGAIKRYLGMEQSGKDPQKCKHFVKLKGPLVAYLKDLLKLLSSVSSENILTVLLKHLHQLAVYVACFNSISKQALKKLIPLWSGSEETVRVLSFLCILRITRNQQSALLDLVLKAMYMTYVKNCKFVSPTTWPGINFMRRSLVEMFSLDLNVSYQHVFLYIRQLAIHLRNAIVVQKIENRQAVYNWQFVNSLHLWADLISATSNKPQLQPLLYPLVMVITNTIKLVPTHQYYPLRFHCVEILISLSKETDTFIPILPFIVEILTAYDFNKKNKKMSMKPLDFSCILRLAKSQLMENGFKDSVIDRLYALLLEYTASISNSIAFPDISLLAIIQMKQFLKTCTVANYTKKIRQLLEKIEENSRFIERERGQITFGLNEIKMVAAWESRIKAKGTPLMAFYESWNKVNRIQKRKKITNNDEIAGGLPMIKRPKVPETEAKITKPENEGPMVLFPSDSEDGNVDFKVDGEDESLNKPKKKKLRKKKIKHRSKRRICRK-