Monarch geneset OGS2.0

DPOGS206136
TranscriptDPOGS206136-TA2211 bp
ProteinDPOGS206136-PA736 aa
Genomic positionDPSCF300028 + 1192547-1201112
RNAseq coverage283x (Rank: top 39%)
Annotation
HeliconiusHMEL0087832e-17748.27% 
BombyxBGIBMGA000727-TA2e-10851.08% 
DrosophilaCG32369-PC2e-5633.06% 
EBI UniRef50UniRef50_B0X0254e-6038.89%Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0X025_CULQU
NCBI RefSeqXP_001862997.17e-6138.89%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1951266512e-5835.88%GI12194 [Drosophila mojavensis]
NCBI nr blastxgi|1951266511e-5835.54%GI12194 [Drosophila mojavensis]
Group
Gene OntologyGO:00065083.3e-21proteolysis
GO:00041763.3e-21ATP-dependent peptidase activity
GO:00055151.4e-06protein binding
GO:00082701.4e-06zinc ion binding
KEGG pathway 
InterPro domain[512-712] IPR0159473.1e-32PUA-like domain
[514-709] IPR0031113.3e-21Peptidase S16, lon N-terminal
[435-499] IPR0130831.1e-12Zinc finger, RING/FYVE/PHD-type
[440-477] IPR0018411.4e-06Zinc finger, RING-type
[440-477] IPR0189573.7e-06Zinc finger, C3HC4 RING-type
Orthology groupMCL12323 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206136-TA
ATGGATTCTAATGTGCATTCTGCTAGAACTAGAGGTCAACGTAGAAGAGTGTCATCAAGTCAAAGATTACGACCATATGGAATGACAGATCTTAATGTGTATATTGATACGAATGTTCACTATAATCTCGATATTGATGAAGGAAGTGTTGCCAATAGGACCGAGGAGCCTTGTAGGGTATTGATTAAAAGAAACAACCAAGATTTCAACCATGGATTGGACGAAAGTCAGCGTGGTGATGATAGGATGGGTTCACCGACTCCAAACGCCAGCATAACTGCTGGTCATCTCCAGATAATAGCCGAGGATGAGAATATCGTTCTGGTTGCGTCCCTGGCAAATACTGACACCGATTCTATCGGTACTTTATCACCGACCCTATTACAATTACATCCCACGCACATCCAATCGAACAGTCCAACAGAAAATGGCGAGCAGTCTCTTAATTCTGATCAAATTTGCACTGCACCAGAACAGGACGCTGCTATGAATCCGTTATTCGACGAGGCTATGTACCAGGAGCCATCGGAGACGGAAGTTAAGAAGGAAACCGTCGCTCAAACGGCACAAGTTAAAAGCAACGGTCAAACTCTCACAACGAACTTATTCCTGAATCCTCATTCGCTGGTGAGGGAATTGCCGATGCCTAGCGTGAACGCCCACAAGTTAGCCAACAACCTGATCAAACTGTCCAGATATCTGAAGTCGCCAGCTCCCAGCGATTTATGCTGTCTAAACTGTTGTCGCATCCCGGTTCTGCCCGTCACCGGCCAATGCGGCCACACTAGATGCATGAGGTGCATCGTCGTCAATGGGACATGCCCGTGCGGCGTGAACGCTCCTAAGACGCTATTCGTAAACACGGTCATTAGAGAGATAATTGAAAAAATGATAAAATACATAAGAAGCCCGAGAATACTAGATCCTGGGTCGCCTAGGAAAACCTGCGAGAAGAAATTCCCGCTTATTAGAGCGAGACGTCGTTATTATCGACGCGGACAGAGCTTCTCTAGAGGCGCTCCTCTCAGTAACTCTCCGACCTGGTGTTTCTCAAGACCCCGAGTGCCTTTGACCGTCCAGGCGCGGTTCAAACGTGCTCGGGCGCTACTGGCGGCTGGAGAGTATTTGCAGGCTGCACCTCACCTGGCCAGGGTCGCAGCCTCAACGGAGCCCTGCGCTAGGATGGCGAGATTGATGCTGGCACAGACTATAAGCGCTTTAAGCGAAGGTCACAAGCGGAAGAGCGTCTCCCGGGAGCTGTTCCAGTCTGTGAGGCAGCAGTCTACGATCAGCTGGTTAGCACCCTCGGATCTGGAGTGCGTGTTGTGCACGAACAGCTACACGAACCCGGTTTCGACTCCCTGCGGCCATACATACTGCAGGACCTGCATAGAAAGATCCTTGTACTATAAGAAAAAATGCGCGCTCTGTTTGGGACCATTGGAAAACTTTATGCTACCTGAGACTCAAGACACGTTGTTCATTAGTTCAATACTATCGTCTATCGGAGTGTCGCAGTCTGTTCGTGATGAGGACGTGATACCCGTCGTAACATGCTACGTTGCGTTCCCTGGAATGCCCTGCCCGCTGTTTATGTTCAACCCTCGCTACTGGCAGATGGTGAGACGAGTGTTGGAGTCAGGCACACGAAGATTCGGCATGCTGGCACACGAAGGTGGAAATAACTTTGCTGATTACGGCACAGTGCTCGAGATCTGCGACTGCGTAGTGCTGGAAGACAACCGCTGTATAGTATCGACGGTCGGCGTCTCCAGGTTTAGAGTCATCGAGAGACACATTAGAGACGGGTGTGACGTAGCCCGAATCCAGCCACTGACAGATGTGACACCAACTGAGGACGAGCTCCAAGACCTGCATACTCTGTCCTCGCAGATATCATCCAAAACTCAAACCTGGCTAAAGAATATGGACGAGGGTGTTAGGAAAGAAATCGAAACTGCCTTCGGAGCTATGCCTTGTAAGGACATTCCCGAAAACTGGTGGAACACATCCGATGGACCTAATTGGCTGTGGTGGCTGATAGCCATACTGCCCCTGAAGTCAGAGATCAAGATATTAATACTATCAACACGAAGTCTTCTCAAACGGATGTTGGCTGTATCAAGGACTTTGGACGTCATGGACGCAGAGTTTGTATCAAACGACTCAAAACTGAACATCACTAGCAGAAAGGAATGGCTGAGGAGATGA

Protein sequence:

>DPOGS206136-PA
MDSNVHSARTRGQRRRVSSSQRLRPYGMTDLNVYIDTNVHYNLDIDEGSVANRTEEPCRVLIKRNNQDFNHGLDESQRGDDRMGSPTPNASITAGHLQIIAEDENIVLVASLANTDTDSIGTLSPTLLQLHPTHIQSNSPTENGEQSLNSDQICTAPEQDAAMNPLFDEAMYQEPSETEVKKETVAQTAQVKSNGQTLTTNLFLNPHSLVRELPMPSVNAHKLANNLIKLSRYLKSPAPSDLCCLNCCRIPVLPVTGQCGHTRCMRCIVVNGTCPCGVNAPKTLFVNTVIREIIEKMIKYIRSPRILDPGSPRKTCEKKFPLIRARRRYYRRGQSFSRGAPLSNSPTWCFSRPRVPLTVQARFKRARALLAAGEYLQAAPHLARVAASTEPCARMARLMLAQTISALSEGHKRKSVSRELFQSVRQQSTISWLAPSDLECVLCTNSYTNPVSTPCGHTYCRTCIERSLYYKKKCALCLGPLENFMLPETQDTLFISSILSSIGVSQSVRDEDVIPVVTCYVAFPGMPCPLFMFNPRYWQMVRRVLESGTRRFGMLAHEGGNNFADYGTVLEICDCVVLEDNRCIVSTVGVSRFRVIERHIRDGCDVARIQPLTDVTPTEDELQDLHTLSSQISSKTQTWLKNMDEGVRKEIETAFGAMPCKDIPENWWNTSDGPNWLWWLIAILPLKSEIKILILSTRSLLKRMLAVSRTLDVMDAEFVSNDSKLNITSRKEWLRR-