Monarch geneset OGS2.0

DPOGS201458
TranscriptDPOGS201458-TA2874 bp
ProteinDPOGS201458-PA957 aa
Genomic positionDPSCF300006 - 382349-391737
RNAseq coverage531x (Rank: top 24%)
Annotation
HeliconiusHMEL0159600.089.41% 
BombyxBGIBMGA002611-TA0.082.24% 
DrosophilaUtx-PA0.054.69% 
EBI UniRef50UniRef50_B4JP260.051.61%GH13016 n=1 Tax=Drosophila grimshawi RepID=B4JP26_DROGR
NCBI RefSeqXP_001944528.10.053.24%PREDICTED: similar to uty-prov protein [Acyrthosiphon pisum]
NCBI nr blastpgi|1953874290.053.76%GJ21917 [Drosophila virilis]
NCBI nr blastxgi|1582965430.056.49%AGAP008509-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055152.6e-45protein binding
GO:00054882.7e-31binding
KEGG pathway 
InterPro domain[646-809] IPR0033472.6e-45Transcription factor jumonji/aspartyl beta-hydroxylase
[684-792] IPR0131296.1e-32Transcription factor jumonji
[5-223] IPR0119902.7e-31Tetratricopeptide-like helical
[165-196] IPR0014406.7e-06Tetratricopeptide TPR-1
Orthology groupMCL10441 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201458-TA
ATGGCGTTGTACGTGTCCCCTGGATTCACCCGGGCCGCGGACGCTCACCTCAGGCTGGCGCTCATGTTCAAGGCCCGCCGGCACTGGGCCGCCGCTGCCGTCCACTTCCGAAGGGCGAAGCTCGCGCCGCATCAGGACGCGACCTTCACACGCCTCGAGCTCAGCTTCCACGCTGCGCATCTCCTAGAAGCGAGAGGACTCAGGAAGGCAGCCAGAGATGCCTACGAGCGATTACTGAAGGAACCGCAGCTGTCATCGTCCTTGAAAGCTGATGTCTGTCGCCAGTTAGGATGGTTGTATCATCGTTGCGTGTCTCTGGGCGAGACCGCAGCCCGAGCTCGCGCTGCGGTGTGGTGTCTCCAGCGTGCTGTGATGGCGGAGCCCGAGTCCGGTGCGGGGCTGTACCTGTTGGGGAGGTGCTTCGCCGCTCAGGGGAAAGTGCACGACGCGTTCGTCGCATATAGAAATTCAGTCGAGAAATCCGAAGGGAACGCTGATACATGGTGCTCTATAGGAGTACTCTACCAGCAACAGAACCAGCCAATGGACGCGCTCCAAGCTTACATCTGCGCTGTTCAGTTGGATAAAGGTCACTCAGCGGCCTGGACGAACTTAGGCAGTCTATACGAGAGCTGTTCTATGGCGAGGGATGCGTTCGCCTGTTATAACAACGGTGGACCAGCTGCCACTATGGGAAACGCCCCACTGAGACAGAGACTCGCCTTCCTCAGAGCACACCTGGCACATGCACCAATGCCCTCAGTTACTGGCAAACGTCGTCCCTTGCCTTCGATTGAGGAGGCCTGGAATCTTCCTATATCAGCAGAGATGTCGTCGCGGGCTCCTAAGGCCGCGCCGCCTCCATATCCTGGAGCAGCTGGAGGCAGTTCGGGGGGAACAGCCGCGAGCGCTGGAGGAACGACGGGGAGTGCCGCTGGAGCCGCCAAGGAACCCGCCTCGCTCAGCCACCACCAGCTGCAAACGCTGCAGTACTTGCAGAGAAATTCTCATAACCTGTCACCTCAACAACAAGCGCTGATGCAGCAGCTATTGTCTCAATACCGTTTGTCACAAGCGGCGAGAGCTCGCTCGGTTGCGAAGAGTGAGGGCAGTGCGGCCGGGGACAGCGCCGAGTCGCTGGCAGAGGATCTGTTGAAGAAGTTCACAGACTCACAGCCTGATATCAAGAAGGAACCCGCAGTTAGGAGTCCATTAAACTCTAGCAACGAAGCTATGCTCGGCGGCCGGCAGCCCGTGGTTAAACTGGAGCCGCTGAAGACGGACCCGCTCAAACCCGTCTCCTTCAACATCGGCATGAGCGCCAAGAACATACTGGACGCCTGCAAAGAGAACGCCGGTCCTCCGTCTTCATGGTCCGTCCTGGGTCCAGGCGCTCCGCCCCCCGACCCTCCCGGCGTCCCACCCCCGCGCCTCACCGCGGACCAGCTCGCGCCCCCCGCGCCCTTCGTCTATGTGGAGACCAAACGCGACGCCTTCTCCCCGCAGCTTCAAGATTTCTGCCTCAAACACCCGATCGCGGTGGTGCGTGGTCTCACCGCCGCTTTGAAACTAGATCTCGGCTTGTTCTCTACCAAGACGCTGGTAGAGGCGTGGCCGGACCACGCGGTCGAGGTCCGCACGCAGCTGATGCAGTCCGCAGACGAGAACTGGGACGCGAGCGGTCGGCGCCGCGTGTGGGCGTGTGCCTCGCACCGCTCACATACTACCGTGAGGAAATACGCGCAGTATCAGGCGGGGTCGTTCCAGGAGTCGCTGAGGGAAGAGCGTGAGCGTGGCGCAGCGCCAGCGCACGCACACTCGGCCGGCGCGCTCTCCGACTCGGACGGACGTGAGTCGGGCTCCGGTCCCGCTAAGCGACGTCGAGCCGCTCGTATGCTGCGCTTCGGAACCAACGTAGATCTATCAGACGAACGCAAGTGGCGCGCTCAGCTCACTGAGCTGCAGAAGTTACCGGCCTTCGCTCGCGTGGCATCCGCCGCTAACATGCTGTCTCACGTCGGACACGTCATCCTGGGCATGAACACGGTGCAGCTGTACATGAAGGTCCCCGGCAGCCGTACGCCCGGACATCAGGAGAATAACAACTTCTGCTCCATCAACATCAACATAGGTCCCGGCGACTGCGAGTGGTTCGGTGTGCCGGACTCGTACTGGGGCGGCGTCCGCGAGCTGTGCGACCGGCACGGACTGTCCTATCTTCACGGCTCGTGGTGGCCGGACCCCGAGGAGCTCCGCGCTCACGGCGTGCCCGTGTACCGATTTACACAGCGTCCCGGCGACCTCGTGTGGGTGAACGCGGGCTGCGTTCACTGGGTGCAGGCCACCGGCTGGTGTAACAACATCGCCTGGAACGTAGGGCCCCTCACCGCCAGACAATACTCGCTCGCTCTCGAGCGCTACGAGTGGAACAAAGTCCAAAACTTCAAGTCGATAGTCCCCATGGTGCACCTCACGTGGAACCTGGCCCGGAACATCCGCGTGTCGGACCCGCGGCTCCACCGTGCCATGCGCACGTGTCTGCTGCAGACTCTCCGCGCGGCGGCCGGCACGCTGCAGACGGTGCGCGCGCGCGGAGTGCCGGTGAGGTTCCACGGCCGAGCCCGCGGCGAGGCGTCACACTACTGCGGCGCCTGTGAGCGGGAGGTGTGGCACGCGCTGCTGGTCCGCGAGCACGAGCGCCGCCACGTGGTCCACTGCCTGGCGTGTGCTCGCCGCGCCAGCGCTACGCTGCAGGGCTTCCTGTGTCTCGAGGAGCATCACATGGAGGAGCTGGCGCAGGTGTACGACGCCTTCACCTTACACCGGCCCACGCCCGCCGCGCCCGCGCCGCCGCCCGCACTCACGCCCGACTGA

Protein sequence:

>DPOGS201458-PA
MALYVSPGFTRAADAHLRLALMFKARRHWAAAAVHFRRAKLAPHQDATFTRLELSFHAAHLLEARGLRKAARDAYERLLKEPQLSSSLKADVCRQLGWLYHRCVSLGETAARARAAVWCLQRAVMAEPESGAGLYLLGRCFAAQGKVHDAFVAYRNSVEKSEGNADTWCSIGVLYQQQNQPMDALQAYICAVQLDKGHSAAWTNLGSLYESCSMARDAFACYNNGGPAATMGNAPLRQRLAFLRAHLAHAPMPSVTGKRRPLPSIEEAWNLPISAEMSSRAPKAAPPPYPGAAGGSSGGTAASAGGTTGSAAGAAKEPASLSHHQLQTLQYLQRNSHNLSPQQQALMQQLLSQYRLSQAARARSVAKSEGSAAGDSAESLAEDLLKKFTDSQPDIKKEPAVRSPLNSSNEAMLGGRQPVVKLEPLKTDPLKPVSFNIGMSAKNILDACKENAGPPSSWSVLGPGAPPPDPPGVPPPRLTADQLAPPAPFVYVETKRDAFSPQLQDFCLKHPIAVVRGLTAALKLDLGLFSTKTLVEAWPDHAVEVRTQLMQSADENWDASGRRRVWACASHRSHTTVRKYAQYQAGSFQESLREERERGAAPAHAHSAGALSDSDGRESGSGPAKRRRAARMLRFGTNVDLSDERKWRAQLTELQKLPAFARVASAANMLSHVGHVILGMNTVQLYMKVPGSRTPGHQENNNFCSININIGPGDCEWFGVPDSYWGGVRELCDRHGLSYLHGSWWPDPEELRAHGVPVYRFTQRPGDLVWVNAGCVHWVQATGWCNNIAWNVGPLTARQYSLALERYEWNKVQNFKSIVPMVHLTWNLARNIRVSDPRLHRAMRTCLLQTLRAAAGTLQTVRARGVPVRFHGRARGEASHYCGACEREVWHALLVREHERRHVVHCLACARRASATLQGFLCLEEHHMEELAQVYDAFTLHRPTPAAPAPPPALTPD-