Monarch geneset OGS2.0

DPOGS204118
TranscriptDPOGS204118-TA1152 bp
ProteinDPOGS204118-PA383 aa
Genomic positionDPSCF300184 - 66715-75515
RNAseq coverage64x (Rank: top 67%)
Annotation
HeliconiusHMEL0129014e-9966.80% 
BombyxBGIBMGA013607-TA1e-5467.41% 
DrosophilaCG30502-PB8e-3434.57% 
EBI UniRef50UniRef50_D6WPH57e-3939.06%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WPH5_TRICA
NCBI RefSeqXP_973946.11e-3939.06%PREDICTED: similar to fatty acid hydroxylase [Tribolium castaneum]
NCBI nr blastpgi|3323743463e-4038.26%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|2700103645e-4339.06%hypothetical protein TcasGA2_TC009754 [Tribolium castaneum]
Group
Gene OntologyGO:00200373.3e-11heme binding
KEGG pathwayafm:AFUA_1G158209e-20 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[8-61] IPR0011993.3e-11Cytochrome b5
Orthology groupMCL15674 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204118-TA
ATGGCATCTAGCTCTTTCCCCGTCAGTTACACGGGAAATGTGTATGACATAAAGAGTTTCCTGAGGGATCATCCCGGGGGAGTTCGGACTTTGGAAAAGTATAAGGGCAAGTCCATCTCGCAGGTGATGAGAGTGTACGGGCACAGTTACAGCGCGTATCACATGTTGAGAGACTTTAAAGTTAGTGATGGGAACTTGACGGGGGCTGTTAGTGAGCGTGGTAGGATTATTACCAAGGAGGAAGAGGAATTAGATGATAAGGAGATAGCCTTCCTCGAGGAGTTGGAGAGTAGACTGGATTGGTCGAAGCCTCTTCTGTCACAGCTGAGCAAAATAGCGCCACACTATCAGAAATGGGTGAACAGCGCCGTTTATAGGAAGTGCCGTCTCTTCGAGAGCCCGCTCCTGGAGGGCTTGACGTACACCCCCTGGTATCTCGTGCCAATGTTCTGGATACCCGTCATTATTTACCTGACGGTGACGCAATGTTTCAGTCACGTATATTGTGGAGATGTCTGCACGAGCCCCATCACAGAGGTGGAATTCGTTTTCCACATGGCGATCGGCTTTATGGTGTGGACGTTCCTGGAGTACTCGCTACACAGATGGGTGTTCCACTACGACCCTGGCTCCTCAATAAAACTGATACAGCTGCATTTCCTGATACACGGAATGCATCACAAGCTCAAATACGATAGCACATCTGACTTAAAATCTCTGGAGGACACTCAAAATAAACTTTTGACGGAGTGTAGTGTTTTGGACCTTAAGCTCAAGGACCTGGAAACTAAAACAAATGAGGCCAAGGAAGAACTATTGGACTTACAGGCGGAAGTTAATAATCAGCATCACTGGGCTAGAATCACTAATATTGAAGTAACAGCTTTACCAGCGCTTCATAGAGAATCACCTATAGATTTAGTTATCGGGATTGCTAACTTTGCTGGTGTGATCCTTACACGCGATCAAGTCGAGTTTGCTACGAGGGTCCAACCACAAAAACCAAACCCTAACAGACCAAAATCCCTCATTGCAAAACCGAAGAACCGGGATGTTAAAGACTCAATTTTATCTGAACTTCGTAAACTTAAAGGCATTAATACAAAAAATAGAGATGTTGAAGGAGCCTCAAAAAAAGGTTTCTGTGAATGA

Protein sequence:

>DPOGS204118-PA
MASSSFPVSYTGNVYDIKSFLRDHPGGVRTLEKYKGKSISQVMRVYGHSYSAYHMLRDFKVSDGNLTGAVSERGRIITKEEEELDDKEIAFLEELESRLDWSKPLLSQLSKIAPHYQKWVNSAVYRKCRLFESPLLEGLTYTPWYLVPMFWIPVIIYLTVTQCFSHVYCGDVCTSPITEVEFVFHMAIGFMVWTFLEYSLHRWVFHYDPGSSIKLIQLHFLIHGMHHKLKYDSTSDLKSLEDTQNKLLTECSVLDLKLKDLETKTNEAKEELLDLQAEVNNQHHWARITNIEVTALPALHRESPIDLVIGIANFAGVILTRDQVEFATRVQPQKPNPNRPKSLIAKPKNRDVKDSILSELRKLKGINTKNRDVEGASKKGFCE-