Monarch geneset OGS2.0

DPOGS200537
TranscriptDPOGS200537-TA1914 bp
ProteinDPOGS200537-PA637 aa
Genomic positionDPSCF300119 - 298853-301970
RNAseq coverage287x (Rank: top 38%)
Annotation
HeliconiusHMEL0085680.076.13% 
BombyxBGIBMGA001985-TA2e-5231.26% 
Drosophilam-cup-PA3e-17447.77% 
EBI UniRef50UniRef50_E3WWG90.050.60%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WWG9_ANODA
NCBI RefSeqXP_001606383.10.053.99%PREDICTED: similar to sex-determining protein fem-1 [Nasonia vitripennis]
NCBI nr blastpgi|3754933260.055.38%fem-1 homolog A-like protein [Locusta migratoria manilensis]
NCBI nr blastxgi|3754933260.055.38%fem-1 homolog A-like protein [Locusta migratoria manilensis]
Group
KEGG pathway 
InterPro domain[37-266] IPR0206837e-45Ankyrin repeat-containing domain
Orthology groupMCL16547 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200537-TA
ATGCAAGAGTGCAAGCATAGTGCCCCAGGTGCACGCCTCTCTGCAAAGCTGCGGACAGCTTTAGAAAGGATGCCTCCGGAGGAGCGTCGTGCAGTCTGCAGTAGGACCCGCGAGGGTTGCGCGCCGTTGTTCGTGGCCTGTCGCCGAGGCAACGTGGAGCTGGTGGAGTATCTGGTGCACGTGTGTGCGGCCGAGTTGGAGCAGCGTGGTGTGTACGAGGTGCCACATGACCGCTCCACGCACTCCGTCACTCCGCTGTGGTGCGCCGCCGTGGCCGGTCGTCTCGAGGTGCTGCGTGCACTGGCCGACGCCGGCGCCGACCTGGATGCTGGGAGTGACAGCGGCTCGACCCCGGTACGCTCGGCGTGCTTCATGACGCACCTCGAGGTGGTGCGCTTCCTGGTGGAACGCGGCGCCGACATTCACCGCGCCAACCACAACGGCGGCACGTGCCTCATTAACTCGGTGCAGTCCGTCCCTCTGTGCGCCTTCCTGCTGGAGCGCGGCGCGCACGTGGACGCCCGCGACATGCAGCACAAAACGGCGCTCCACTACGCCATCCAGGAGCACCGCCTCGAGACCGCGCGCCTCCTACTGGACCGCGGTGCTTCCCCTCACCTCCGCTCCCGAGCCGGCGACGACGCGCTCCGTACGGCCTGCCTCAAGGGTGCGTCGCAGATCGCAGCCCTGCTGTTGTCTCGTGTCCGATACTCACCGGCTCGGACGGCCGACGCCTACGAGCTGTTGGGTGCTACCCAGCTGGACGAATTCAACGATGTGGCGGCCGCTCTTGGCTCCTGGCGGCGGGCGACCGCCGTCCGGCACGCTCACGGCGGCTACGTCGAGAAGACCGGCGATCCCTTAGCGGAGGACGCAGCGGGCGGAGCGGACGCGCTGGCGGCCGGCGCCCTGGGCGGGGCACGCGAGTGGCGCTCGGCCGAGGAGCTGGAGATACTTGCCACCGACGTAGACGCGCTTCGCACGCAGGCTCTGCTGGTTGCCGTCCGCGTGCTCGGGGTCGCTCACAAGGACACCGTGTTCCGTCTCATGTACCGAGGAGCGTCGTACGCTGACGCCTTCCGCTACCAGCGATGCATCGACCTTTGGAGCTGGGCGCTGCAGATTCGTATAGAGAAAGATTCTCTGCTGTATACGGACACGTGTCACACAGCGAGCGCCCTGACCCGTCTGTTGTTGGACGCGGGCGGCGGTCGCCTGGAGCGTGCGCGAGGTCTGCCGAGACACCAGGACGTGCTCCGCGTTTTTACTCTACTGGCGGACCACCTGCCAGAATGTCGTCGTGCGCTGGTCGCCCGCCCCGTCCACAAGAAGCAGGCGGAGACCTTCGATCGCGCGCTACGTTGCGTGTCTCACCTCCTCCATCTCCTGCTGCTGACCGCTCGCTCGGAGAGCGATCACGAAGAGGTTCGCGCCCGTGTCCGCCGGCTGGTGGCGGCCGACGTTCGCAGCGCCCACACGGGTGACACGCTCCTCCACCTGTGCGTGTCGCGCCTGAACGTAGTCCGCTCCACGTACTTCGCCGACGAGACGGCGGTCCCGCCGGTGTTCCCGAGCGTGAAGGTCGTGGCGCTGCTGCTGAGCTGCGGAGCGGACGCTCGCGTGCGCAACGAGGCGCGGTCAACGGCGCTGCACGTGGCCGCCATTCCGTACAACTTCTCCACCGTGCTGGTGGAGACGCTGCTGGCGGGCGGCGCGCACCTCGACCAACCCAACCGCTTCGGAGACTCGGCAGCGGAGCTGGTGTCCCTGAACCGCGGCTCCCGCGTCCGCGTGCTGCGTCACGTGTCCCTGGCCTGTCTCGCGGCGCGCGCTCTGCTCGCGTCCCGGCGGGATATCCCCCCGCACACCCTGCCGCGGACGCTCCATGCCTTCCTCGACCTGCACCGAGCCTGA

Protein sequence:

>DPOGS200537-PA
MQECKHSAPGARLSAKLRTALERMPPEERRAVCSRTREGCAPLFVACRRGNVELVEYLVHVCAAELEQRGVYEVPHDRSTHSVTPLWCAAVAGRLEVLRALADAGADLDAGSDSGSTPVRSACFMTHLEVVRFLVERGADIHRANHNGGTCLINSVQSVPLCAFLLERGAHVDARDMQHKTALHYAIQEHRLETARLLLDRGASPHLRSRAGDDALRTACLKGASQIAALLLSRVRYSPARTADAYELLGATQLDEFNDVAAALGSWRRATAVRHAHGGYVEKTGDPLAEDAAGGADALAAGALGGAREWRSAEELEILATDVDALRTQALLVAVRVLGVAHKDTVFRLMYRGASYADAFRYQRCIDLWSWALQIRIEKDSLLYTDTCHTASALTRLLLDAGGGRLERARGLPRHQDVLRVFTLLADHLPECRRALVARPVHKKQAETFDRALRCVSHLLHLLLLTARSESDHEEVRARVRRLVAADVRSAHTGDTLLHLCVSRLNVVRSTYFADETAVPPVFPSVKVVALLLSCGADARVRNEARSTALHVAAIPYNFSTVLVETLLAGGAHLDQPNRFGDSAAELVSLNRGSRVRVLRHVSLACLAARALLASRRDIPPHTLPRTLHAFLDLHRA-