Monarch geneset OGS2.0

DPOGS213212
TranscriptDPOGS213212-TA1551 bp
ProteinDPOGS213212-PA516 aa
Genomic positionDPSCF300114 + 319157-324637
RNAseq coverage2615x (Rank: top 5%)
Annotation
HeliconiusHMEL0029910.066.10% 
BombyxBGIBMGA007406-TA3e-5786.07% 
DrosophilaUbi-p5E-PA7e-1346.15% 
EBI UniRef50UniRef50_UPI000203A40A3e-3739.71%UPI000203A40A related cluster n=1 Tax=unknown RepID=UPI000203A40A
NCBI RefSeqXP_002426715.12e-3547.44%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|1706719627e-3841.18%uncharacterized protein LOC100144290 [Xenopus (Silurana) tropicalis]
NCBI nr blastxgi|1706719622e-3641.18%uncharacterized protein LOC100144290 [Xenopus (Silurana) tropicalis]
Group
Gene OntologyGO:00082703e-21zinc ion binding
GO:00055154.5e-16protein binding
KEGG pathway 
InterPro domain[442-512] IPR0000583e-21Zinc finger, AN1-type
[33-100] IPR0006264.5e-16Ubiquitin
[37-57] IPR0199567.3e-08Ubiquitin subgroup
Orthology groupMCL20462 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213212-TA
ATGTCACAACATGGATGCCCTGAGAGACTGTCGAATAGATCAAAGGGCTTTCAGGAAGATGGGTCGAGTCAGCCCACGATGGAAGTTTTGGTGGAAACCCTCACAGGCACGGCGTTCGAAATGACCGTTTCTCCGTCTGACACTATATTTGCGATCAAATCGAAGATATACAGGGTAGAAGGTATCCCGGTATCGCAACAACACTTGCTGTACAACCTTCGCGAGTTGGACGACGGCCTGTCTCTCAGCGAACATGCCATCGGTGACGGCGCCCGCCTCCGTTTAGTGTTAGGGCTCCGCGGCGGCCCCGTCGCTACTAGACGACTCCCGCCACCGGAGCCCTGGAGGGACATCGAACGGTTCCTCCATCTACACAAGTCAGACGAGGATGACGCGGAGTGGAACGGTTCCGGGTCGGGATGTAAGGCGACCGTGTTGGTGTTCAGAGAAGGAGAGAGGGTGAACCTGGTGCCGGTCCGGGAGAACAGAGATGGCACATACTCGCCGCTCGACCACAATAAGTATTCATCATCTCTAAGTCACCTAGTGGAGACGGAGGCGGGCGAGGGCGGGGCGGGCGCGGGGGCCGGTGTGGCCGGGGGCGGAGCGCTACACGAGAACGCTGTCACTCTGGGGAAGATGCTGGAGTTGAGGAGACGCATGGAGAGACTGGCGCTGCACAGACGACCACCCGACAGGACGGGCGAGTCTCATGACATCCAGAAGACGAGGAGCGAGGAAACCTTGTCCGTGCCGAGCCTGCTGGACGAGGCCGCGGAATTCGCCGGCAGGAGCTACGCCGGCTTCGGGGACGAGGAGGGCTTCGCCCCCCTCTACGACGGCAGCTACACGCGCTACGACTACGACTGCGGGTTCACCGACCGCTACACGCTTCTGCCGCCCATCGGCGCCAGGACGGACTCCGACCAGTCCATCCAGGACTCCGTCGCCGAAGAACGTCTCAAGCTGTCGGACGCCCTCGCGGGGTCCATACTGGTCAAAGCGCGGGAGGGCCGCGAGGACGACGCCATCCTGGAGGAGTGCGTGGACGGCGAGCCGTGTCTGAGGAGGGACTCTCCGCCGCTCGCAGCCAACAACCTGGCGCCCGTGGCAGCAGAATACGGCGCGGTGTCAGTGGGCGCTCGTTGGCGGCGCTCGTCGTCCTCGCTGGGCGCTCGGGCCGCGCCCCTCGCTGACACACTGTTCTGTTCCTCCACATCCGACCTGGAACAGCTGCGACGCAACAGAATCCTGCCGGCGCTGAGTCGTCACCGGAACCGCGAGCACCTGCACCTGAGCGACGAGGGCCTGGACCTGACGCGGCCGGAGCCCGAGCCGGAACCGGATAAGAAGACGCGCGTGCGCTGCGGCTTCTGCAGGAAGAGGCTCAGCATCGCCACCGTCCACACGTGCCGCTGCGGGGCCTCGTTCTGCGCGCCGCACAGGTACGCGGAGGTCCACGGCTGCGCCTACGACTACAAGGACGAGGCCCGGGACCTCCTGCGCCGCGCCAACCCCCTCGTCGGCGCGCCCAAGCTACCCAAGATATAG

Protein sequence:

>DPOGS213212-PA
MSQHGCPERLSNRSKGFQEDGSSQPTMEVLVETLTGTAFEMTVSPSDTIFAIKSKIYRVEGIPVSQQHLLYNLRELDDGLSLSEHAIGDGARLRLVLGLRGGPVATRRLPPPEPWRDIERFLHLHKSDEDDAEWNGSGSGCKATVLVFREGERVNLVPVRENRDGTYSPLDHNKYSSSLSHLVETEAGEGGAGAGAGVAGGGALHENAVTLGKMLELRRRMERLALHRRPPDRTGESHDIQKTRSEETLSVPSLLDEAAEFAGRSYAGFGDEEGFAPLYDGSYTRYDYDCGFTDRYTLLPPIGARTDSDQSIQDSVAEERLKLSDALAGSILVKAREGREDDAILEECVDGEPCLRRDSPPLAANNLAPVAAEYGAVSVGARWRRSSSSLGARAAPLADTLFCSSTSDLEQLRRNRILPALSRHRNREHLHLSDEGLDLTRPEPEPEPDKKTRVRCGFCRKRLSIATVHTCRCGASFCAPHRYAEVHGCAYDYKDEARDLLRRANPLVGAPKLPKI-