Monarch geneset OGS2.0

DPOGS210773
TranscriptDPOGS210773-TA1533 bp
ProteinDPOGS210773-PA510 aa
Genomic positionDPSCF300312 + 142344-148491
RNAseq coverage130x (Rank: top 56%)
Annotation
HeliconiusHMEL0138883e-16161.02% 
BombyxBGIBMGA011116-TA0.073.59% 
DrosophilaCG5065-PC2e-11341.41% 
EBI UniRef50UniRef50_D2A3T29e-13345.49%Putative uncharacterized protein GLEAN_15753 n=4 Tax=Tribolium castaneum RepID=D2A3T2_TRICA
NCBI RefSeqXP_001663993.12e-13745.72%hypothetical protein AaeL_AAEL013802 [Aedes aegypti]
NCBI nr blastpgi|1571374453e-13645.72%hypothetical protein AaeL_AAEL013802 [Aedes aegypti]
NCBI nr blastxgi|1571374456e-13146.14%hypothetical protein AaeL_AAEL013802 [Aedes aegypti]
Group
Gene OntologyGO:00054881.9e-28binding
GO:00166201.1e-17oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551141.1e-17oxidation-reduction process
KEGG pathwaynvi:1001176832e-117 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[39-306] IPR0131203e-69Male sterility, NAD-binding
[294-336] IPR0160401.9e-28NAD(P)-binding domain
[378-471] IPR0042621.1e-17Male sterility
Orthology groupMCL17409 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210773-TA
ATGGAGTGGCTGAACGGGAATGACCCAAAAATACCAAACTTCAATGACCTTAATGAGCCAAGAAACCTCTGCGAAGAGAGAATCGCGGACTATTTCCAGGATTCTATCATTCTGGTGACGGGGGGCACCGGCTTCGTGGGGAAAGCACTCCTGGAGAAGCTTTTGCGGAGTTGTCCTGGGATTGATACCATTTATATATTGATGAGACCCAAACGTGGTCTCTCGGTGGAACAGAGGTATAAGGAACTTTTGAAAAATCAGGTATTTGACCACATACGAACAAGATGGCCGGATCGTCTATCCAAGCTCTATCCAATAACTGGTGATGTATCAGCCCCAAATCTTGGCGTCAGCGCTGAACAGCAGCAATTGTTGAATAAATGTCATACCGTATTCCATTCTGCTGCAACTGTGAGATTTACTGATCCATTGCACGCTGCAGCCGCCCTTAATGTGCAAGGAACGGCTAGCTTGCTGCAATTGGCTGAGGACATGCCCTTTCTTAAGGCCCTGGTTCACGTGTCTACCGCATACAGCAATGCCCCTCGTAGCCACATCGAAGAAAGGGTCTACGCCCCACCATACGACCCTGAGAGCATTGTCAGATGCACCAAGATGTTACCGGCTGAAACCGTCGAAGTTATTGCCGAATCCTTACAGGGTGAACATCCAAACCCATACACCCTCACGAAGGCTCTGGCGGAGTCCATCGTATACAGTCACACAAATCTCCCCGTCTGCATAGTACGTCCTTCCATTGTGACAGCTGCCTACCAAGAACCTTTTCCGGGATGGATTGACAACATATATGGAGTGACAGGTATTATAATGGAAATTTCTCGAGGGACCTACCGTTCTGGTTATTGTCGTGAACGGTACGTGGTCGACCTGGTGCCAGTGGACATGGTGGTCAATAGCTGCATACTCGCAGCTTGGAGGCAAGGCAGCAAGAAACCTGGTCGTTGTCCCGTGTACAACGTGACGTCTGGGTCAATTAATCCGTTGCAATGGGGACATTTCACCAAACTTTGCGTCAAATGGGCGAGAGAGAATCCCACGAAGTACGTCATGTGGTATCCTAACTTTGCGTTCACGGAGTCTCGTGTCATGAATACGTTTTGGGAAATTTCTTGTCACTTCTTACCGGCTTTCTTGTATGATTTACTACTAAGGGCCCAAGGAAGGAAAGCTATAATGATGAAACTGGCTCGTCGTTTCAAAATGGCGGCTGCGACTGGCGAATATTTCGCCAATCACGAGTGGCAGTTCAGTGTGTCCGAATTGACAGCTCTCCACGATGAAGCTAGTGTTGCCAGTGACGCAGGCGCGTTCTCCCACTGGCCAGGGCACTTCAACTGGGAGTCGTATATCGGTTCATATATGTTGGGAATTCGGAGATTTATTCTCAAGGACAGCATTGATTCTCTGCCCCAAGCCAGGAATAAACTGAACAGATTATATTGGGTACACCGGCTGTTCCAAGTCGCAACCGGATATTATCTCTTCAAATTTTTAGCTGGCCGATTACGATAA

Protein sequence:

>DPOGS210773-PA
MEWLNGNDPKIPNFNDLNEPRNLCEERIADYFQDSIILVTGGTGFVGKALLEKLLRSCPGIDTIYILMRPKRGLSVEQRYKELLKNQVFDHIRTRWPDRLSKLYPITGDVSAPNLGVSAEQQQLLNKCHTVFHSAATVRFTDPLHAAAALNVQGTASLLQLAEDMPFLKALVHVSTAYSNAPRSHIEERVYAPPYDPESIVRCTKMLPAETVEVIAESLQGEHPNPYTLTKALAESIVYSHTNLPVCIVRPSIVTAAYQEPFPGWIDNIYGVTGIIMEISRGTYRSGYCRERYVVDLVPVDMVVNSCILAAWRQGSKKPGRCPVYNVTSGSINPLQWGHFTKLCVKWARENPTKYVMWYPNFAFTESRVMNTFWEISCHFLPAFLYDLLLRAQGRKAIMMKLARRFKMAAATGEYFANHEWQFSVSELTALHDEASVASDAGAFSHWPGHFNWESYIGSYMLGIRRFILKDSIDSLPQARNKLNRLYWVHRLFQVATGYYLFKFLAGRLR-