Monarch geneset OGS2.0

DPOGS201266
TranscriptDPOGS201266-TA1209 bp
ProteinDPOGS201266-PA402 aa
Genomic positionDPSCF300037 + 845114-854197
RNAseq coverage210x (Rank: top 46%)
Annotation
HeliconiusHMEL0105860.080.99% 
BombyxBGIBMGA008082-TA0.075.19% 
DrosophilaCG11162-PA8e-1127.64% 
EBI UniRef50UniRef50_E9G4893e-9949.04%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9G489_DAPPU
NCBI RefSeqXP_001850582.16e-8356.25%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3323733109e-10747.81%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323733106e-11247.81%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00055069.6e-18iron ion binding
GO:00066339.6e-18fatty acid biosynthetic process
GO:00551149.6e-18oxidation-reduction process
GO:00164919.6e-18oxidoreductase activity
KEGG pathwaylma:LmjF23.13003e-17 
 K00227 (E1.14.21.6, SC5DL)maps-> Steroid biosynthesis
InterPro domain[223-332] IPR0066949.6e-18Fatty acid hydroxylase
Orthology groupMCL22175 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201266-TA
ATGGCGTCAGTACGACAAAGATCTCATGGGAATGAGACAAATGGGATCGATGAAGCTGGAACCAATAAAGCTAAGACCGTCGAAGAAGAGAAACCAAAATATTGGGACCCGCTCGTAGATGGTGTTAAATGGATAGAAAGATATGCGGAACCTCTAGAGAAGTTCTTTGAGAGGCTCCCAGACTTTATAAGCACCTTCGTAGCAACCTTCGCAGTCTTCACATTCGGCGCCACTTTGAGAGGAGAATGGGTGGTTATATTAGTGACAGCATTAAAACAGATTTCCGGCCACACCCAAAGCGGAAAAAATATGACAACCGAGGACATCTACCAGCTCTTTACGTCTGAAAATTTAAAGATGGAGAACTTCGCTCTAATCTTCATAGTGTCTCACTGCGTGTCGTTGGGCATGTACTTCTCTATAGGAGGGTTCTTGCATTGGTATTTCTACATGAAAAGACGTCATCTAGCACACGAATGGAAGATTCAACCAACAAAATGGCTCTCCCCTGAGCTAGAACGTCATGAAATAATGGTTGGAACATTGTCTCTACTTATCTCAGGGTCGTTCTCCTCATTCCTAGCATGTTACATCTTTAATGGAAACCCATGTACTGTTTATTTCCAATTCGGTGAATATGGCTGGTTGTGGTTCATCCTCCAGTTTCCTGTTATGTTCATTTATTCTGATTACACAACCTATATACTTCACCGGCTGTACCACACACCCTGGCTGTACAAGCATTTCCACAAACTTCATCATAAATACAAACAGCCCACCGCCTTCTCCGTCACCGCCATCCATCCAGTTGAGATTATGCACGTTAAGCTCACGATGTGTTTGCCTCTGTTTACTATTCCCATACACTGGATGGCGTTTTATGGCGTGATTTTGTACAACTACTATCATGGTATCCTCGACCACTCCGGCATCAACTTTAAAGCTCAGTGGTGGCAGCCTTGGCAGCCGGATGCCGAGTTCCATGATCAGCACCACGAATTTTTCCATTGCAACTTTGGCTTCAACATGTCTCTTTGGGATAGGTTGCACGGTACCATGAGGAAAACTACTCGGGTTTACACAGAAGATACATATCACGGTGAAGCTCCTGAAATAGATTCCGAAGAAGCGAAGATGATCATGGAAAAGGATCCAGAAGTTAAAGAATATATAGAAAAAACAAAATCCCAACTCAATGAAACTAATTAG

Protein sequence:

>DPOGS201266-PA
MASVRQRSHGNETNGIDEAGTNKAKTVEEEKPKYWDPLVDGVKWIERYAEPLEKFFERLPDFISTFVATFAVFTFGATLRGEWVVILVTALKQISGHTQSGKNMTTEDIYQLFTSENLKMENFALIFIVSHCVSLGMYFSIGGFLHWYFYMKRRHLAHEWKIQPTKWLSPELERHEIMVGTLSLLISGSFSSFLACYIFNGNPCTVYFQFGEYGWLWFILQFPVMFIYSDYTTYILHRLYHTPWLYKHFHKLHHKYKQPTAFSVTAIHPVEIMHVKLTMCLPLFTIPIHWMAFYGVILYNYYHGILDHSGINFKAQWWQPWQPDAEFHDQHHEFFHCNFGFNMSLWDRLHGTMRKTTRVYTEDTYHGEAPEIDSEEAKMIMEKDPEVKEYIEKTKSQLNETN-