Monarch geneset OGS2.0

DPOGS213800
TranscriptDPOGS213800-TA3102 bp
ProteinDPOGS213800-PA1033 aa
Genomic positionDPSCF300106 - 97352-110422
RNAseq coverage75x (Rank: top 65%)
Annotation
HeliconiusHMEL0221190.067.81% 
BombyxBGIBMGA006785-TA3e-17470.50% 
DrosophilaCyp6a13-PA4e-7135.73% 
EBI UniRef50UniRef50_Q1KHF55e-14067.61%Cytochrome P450 (Fragment) n=3 Tax=Bombyx RepID=Q1KHF5_BOMMO
NCBI RefSeqXP_001944599.12e-8539.68%PREDICTED: similar to cytochrome P450 CYP6AY1 protein [Acyrthosiphon pisum]
NCBI nr blastpgi|3056714086e-17170.26%cyp6u1 [Bombyx mori]
NCBI nr blastxgi|3056714085e-16870.26%cyp6u1 [Bombyx mori]
Group
Gene OntologyGO:00090551.2e-91electron carrier activity
GO:00200371.2e-91heme binding
GO:00167051.2e-91oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.2e-91iron ion binding
GO:00551141.2e-91oxidation-reduction process
KEGG pathwaynvi:1001140236e-78 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[12-412] IPR0011281.2e-91Cytochrome P450
[231-248] IPR0024014.8e-13Cytochrome P450, E-class, group I
[833-884] IPR0050624e-06SAC3/GANP/Nin1/mts3/eIF-3 p25
Orthology groupMCL22141 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213800-TA
ATGCTACAATTCTTCTACGACAAATACAGAGATGAGAGATATGTTGGAATTTTTCAAGCTAGGAGACCGGCTTTGATGCTGATAGATTTAGAATTGATAAAATTTGTATTGTCCAAAGATTTCCAACATTTCACTGATCGTATATCTGTTTCGACGGATACGCAACGGGAACCGCTTTTGAGGAATCTAGCAAATATGAGCGGCACGGAATGGCAGAAAATGAGGCACATAGTTACACCGACATTCTCCTCGGCTAAAATGAAAGCCATGTTCCCTTTAGTCGTCGATTGTGCAAGAACATTACAAACAACTCTAGAAAACGAGTCCATTGAGGATATCGAGGTGCCGAAGTTAATGTGCCGGTTTACAACTGACGTTTTAGGAAGCTGCGCCTTCGGCGTCGATCCTGGATCATTAAAAGATAAAATGTCACCATTTTTTATTATGTCGCAGAAAATGTTCAAAACCGATCGCAGCACTATATTAAAGAGATATTGTCGTTCTTTTTCGCCGCGACTGTTTAAATTTTTAAACCTAAGAACGTACTCGCTGGATGTGGAAGTATTTTTTACTAATATCATCAATCAAGTATTAAACGAGAGGCGAACGACCGGCAAGCAGCGAAGTGATTTCCTACAGCTTATGTTGAATGTCCAAAAAACTGAGATCGGTTTTACGATGACGGATGAATTAATTATATCAAATTCATTTATATTCATGCTCGCCGGTCTAGAGACATCAGCTACAACGCTATCGTTCTGTTTGTATCAGCTCGCCAAAGATATAGATTTACAGAACAGATTAAGAGATGAAGTTAGAGAGTGTATAGAAAATCACGGCGGTTTTAATTACGACGCGATAGGTGCGATGCGTTTGGCGACTCAGACATATTTAGAGACGCTGAGATTACATCCTCCGACGCCTCTTACGACGAGGCTGTGCACATCACCATGCACATTACCTGGTACAGGTCTCAATATGAAAGTCAGAGACGCAGTTCTAGTGCCGATACACCAGATCCACAAAGATCCGAGGCATTTTCCCGATCCGGAGAAATTTGATCCTGAACGTTTCGGTGGTGCCATGAATGTTAACGGTTTTATTGCATTCGGTGACGGACCCAGGAGTTGTCCAGGAGGTCGTTTCGCTCAGATGATGGTGGTAGCTGGTTTGGCTACGATTCTTCAGAATTTCTCAGTGGAGCCATGCTCTAAAACAACACCAACTATACAATACGAAACACGAAGCGTTACAGACATCGAGGTACTCCAAGCCATCAACCAAGGAGACTTCATAGAAGACTGTCTCAATATATTCGTAGAGTTCTCCAACAATAAGCATTACGAACACAATATAGAACCGAATATCCCACTATCCGTCCTAGACCGTGGTTCAGAACTAGAAGTACGTGCACAGAAGAGCGAGTTCAATTTGTCTCCAACATTAGCGTACACACGGCCACCGCCGACAGCCAACCTGGTGGCGGCAGTCGCTGACGTCATCAATAATATCACATTATACAAGAGAGCCCCTGTGCAGCTTTGGAATGAAAAGGGATTGTACAGAGTTTTGTTTAGGTGTATATCTCAGCCTAACGTCTCACAGCGCTCCTTCGCACACACGGCCGTCTGTCGAGCGCTCGCTGCGTCCTCTACACATAAATGCGTGAGGGTCGCGCTCGCCAATACAAAAGACTGTGTGTATCACTTGCTACTAACACTCACTCCTATTGAGTCAGATCCTAGTTGGGTTCTCATAGCGTCTTGCCTCAGTTCTGTGCTATGTTCGAGTGTTCGCGCCCGTTCCTTCGTGGTCCATCGACAGCTTTTCCGCGATATATCTGGTGTACTTCACACCATGAGAGATCACCTCACACTGATGGGGAAACCCATAGACGTTATACGGAACGCTAACCATGAGCCGACCTTGAACACACTGAACTGGGTTCTCATCCTAGCGAGCAGCATGATGGTGGATAATCCCCCAGCCAAGGACCGTCTCTCGGAAGACATAGCCGCCTCCCTGACGCGGCTGTGGCCCTGGTGTATGATGACGGAGGAGTTACGAAACAGTGTCATGCAATTTCTTTTGATATTCACCAATGATTGCCCTAAAGACAACTCTTCTGAGGAATTCACTGCAGCTCAAATAATGACTATCATGTCGATGTCACAAAGTCATATTACGGAGGAAGTACTTGATTTTGAGCAGTATGAAAATATAGCACCCGGCCGAGTTTCATACTTAGACAGGACTGACTCAAGTTTTGGATTAGAACTTCTACACTTTTATATTTTTCTCCAGTTAATTATGGACAGTAGGAACTTAAACAAGGACGGTGAAACGTGTATTCACGGAACATGTTTAGATATGTGCCCACCACAGGAGATGAACTTGAGGAAAAGGGAGAAATTGGTTCATAAATTAGAAGTTACAACGGAAGGTTACAAATTAGTTAAATGCTATAGTCGCTCAGCGGCAGATTCGAACATGGCTGTACCCAGCCAACTTCGACCCTTCCCCACACTTATGACAACAACACAATATTTGTTATTAAATGTTTCAAAAAGGAAAGATGTCAAAATGTCAGTCATATACAATTTCTTGGATGACCGTCTCAGATCTGTGAGGCAGGATATGACGATACAGAGTGACGTGTTATCCGTGTGGTGCAAGCTCCGCGGGAGGTGTCCGTCCTCGCTGTCGTGCTGGTGGTCGCTGTGTGCGAATGTTTGTCGTCACCCGGACGGAGCGGCCGCGGTGTTAGGTTCGATACAGGCGAAGCCAGCGCCAGCCGCGTTACTGCCGGCCCTCGCTAACGCTGCACATCACTGCAGACATGCGTTCTTGCAGTCGTCGGACCTATTGGAGTTGTTGTCTAGCTGCCTGCTGACAGGAGACACGGCCGAGATTGTGTCGTCAGCACGAGCGGTGTGGGCGCTGGCCGCTAACAATCATAGGGCTAAGCTGGTACTCCGTAGTGCAGGGCTCCAAACAGCAGTACAAACTACTTTGCAGCGTTTGCAAAAAAACAAAGATGTCGCCACTCAACGAGCTGTCGAGTTACTGACCTACACCAACACCGTACTTCAAGCTATATGA

Protein sequence:

>DPOGS213800-PA
MLQFFYDKYRDERYVGIFQARRPALMLIDLELIKFVLSKDFQHFTDRISVSTDTQREPLLRNLANMSGTEWQKMRHIVTPTFSSAKMKAMFPLVVDCARTLQTTLENESIEDIEVPKLMCRFTTDVLGSCAFGVDPGSLKDKMSPFFIMSQKMFKTDRSTILKRYCRSFSPRLFKFLNLRTYSLDVEVFFTNIINQVLNERRTTGKQRSDFLQLMLNVQKTEIGFTMTDELIISNSFIFMLAGLETSATTLSFCLYQLAKDIDLQNRLRDEVRECIENHGGFNYDAIGAMRLATQTYLETLRLHPPTPLTTRLCTSPCTLPGTGLNMKVRDAVLVPIHQIHKDPRHFPDPEKFDPERFGGAMNVNGFIAFGDGPRSCPGGRFAQMMVVAGLATILQNFSVEPCSKTTPTIQYETRSVTDIEVLQAINQGDFIEDCLNIFVEFSNNKHYEHNIEPNIPLSVLDRGSELEVRAQKSEFNLSPTLAYTRPPPTANLVAAVADVINNITLYKRAPVQLWNEKGLYRVLFRCISQPNVSQRSFAHTAVCRALAASSTHKCVRVALANTKDCVYHLLLTLTPIESDPSWVLIASCLSSVLCSSVRARSFVVHRQLFRDISGVLHTMRDHLTLMGKPIDVIRNANHEPTLNTLNWVLILASSMMVDNPPAKDRLSEDIAASLTRLWPWCMMTEELRNSVMQFLLIFTNDCPKDNSSEEFTAAQIMTIMSMSQSHITEEVLDFEQYENIAPGRVSYLDRTDSSFGLELLHFYIFLQLIMDSRNLNKDGETCIHGTCLDMCPPQEMNLRKREKLVHKLEVTTEGYKLVKCYSRSAADSNMAVPSQLRPFPTLMTTTQYLLLNVSKRKDVKMSVIYNFLDDRLRSVRQDMTIQSDVLSVWCKLRGRCPSSLSCWWSLCANVCRHPDGAAAVLGSIQAKPAPAALLPALANAAHHCRHAFLQSSDLLELLSSCLLTGDTAEIVSSARAVWALAANNHRAKLVLRSAGLQTAVQTTLQRLQKNKDVATQRAVELLTYTNTVLQAI-