Monarch geneset OGS2.0

DPOGS203449
TranscriptDPOGS203449-TA1725 bp
ProteinDPOGS203449-PA574 aa
Genomic positionDPSCF300242 + 72775-74669
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0095110.070.96% 
BombyxBGIBMGA011114-TA6e-17561.65% 
Drosophilafj-PA2e-10540.47% 
EBI UniRef50UniRef50_B0W4F53e-11047.98%Four-jointed n=4 Tax=Diptera RepID=B0W4F5_CULQU
NCBI RefSeqXP_001237917.25e-11242.77%AGAP008093-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582970689e-11142.77%AGAP008093-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571271772e-10643.82%four-jointed protein, putative [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL16873 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203449-TA
ATGGCGTCGAGCACTGATAAAACAAATCGAAAATGTGAAATCGAGAGATTACCGAGCTTAAAAGATATTAATAATACAGAGAAAGGTACTGATCATAAAATAGGTTTGAATTTTAACTGGTACAAGATGAGGAGACCCCATTTGTTATACAATAACTGTGGGTACAGAGAGGAACAGTTCAGGAGGTTGAAGAAGGAGTACGGCTTCTATAGTTACTGTATGTTGAGTGTCGGACTTAGCTTCGTCCTGGGGCTGGTGATTGGAGCAGCTATAGTCGGCTCCCCGGCGACGTCTCCTAATTTCATTCAGAGAAAATTAGAAAGAATCAACTCGTCTAAAGTAAGAGTTCCTCACGAGAGATTGAACTCTTTGAAACTAACAGGTAACTCTGATGAGCAAATAAAGAAAGAAAGGGATGGTGATAGATTCTCGGCTGTATCCTTCGTGGGATCAGACGATCCCCGTAAATATAAAGTTGATGTGTATCCAAAAGGTTTAACTGATGATATGAGAGAGATCCTGAAAGACACGAAGGCCAGCGACCACAAACATTCCCTACCAGAAGTTGTCAACGGAATGGTTCATGTTCCTAACGATAATACTGTCCTGTATAATAATATTTATTGGGGTCCAGAGGTTGAGAACTCCATGCCTCAAGGATACGGAGAGAATTCAGCTGAGATTTGGGAAAAATACGTGGATCAGAGCGAGGTTATCAAAATGGAAGCTGGTTGTGGCAGGATGCAGAACAGGTTAATAACATTCCAGGACGGAATTCAAGCTTGCGTTCGTTACAGACAGAACACGGATCAGATCCAGGGGGAGATCTTCAGTTTCTACGTCGCCAGGCTCTTGAATCTGACCAATCTGGCGCCCTCCGTGGTCAAAGTTGTGGACTTGAAGGATAAACTGTGGCAGAACGTTGCCAACGACATCGCAACCGCCCAGTGGAACACCAACCGGGCCGTGGTGATCACACAGTACATACCAAGCTTAGACTCAGCGACCATACCTGAAATATTTAAACCCTCGACGCGACATCTTAATAAAATTGACATTTACAAGATGTCCGTCACAGAAAAGAATGATACAAAACAGTTGCTTTTAGATAAAATAAGAGCGAAAAACATTAAAACTAAAATAGAAGTCGCTGATGACTTCGACTACGTAGACGTTAAAGTGAACAAAAAGACGATCGAGCTGTTCGTTGAGTTGGCCCAGTGGTCTGATCTGATAGTCTTTGACTATCTGACAGCTAATCTGGATAGAATAGTTAACAATCTATTCAATTACCAGTGGAATATCAACATAATGGATGGACCAGCTCACAACCTGGCGAGGAAGATGGACAGCGGACTTCTTCTGTTCCTTGACAATGAATCCGGCCTTCTCCACGGGTATAGACTGCTGAAGAAGTACAACACCTATCACAGTCTCATGCTGGATAACCTGTGCGTGTTCAGGAAGAGCACCGTAGACGCTTTGAAGGTCATGTACAGATTACCCATAGGCAAGAAGCTGAGCGAGGTGTTCCACCAGAAAAACAGTGCTGTGATCAGAGACATACTGCCGCCGTTACCGGAGAAGAACGCTAAGATACTTCACGAGCGGTTAGGGAAGGTCCTAGCTCAGGGCGCGATGAAAGCATTTTTGATACCAAATAATATAACTTATGAAGTGCCAGTATGTAAATCACTATTATTTATGTCTGTGTCGGAACTGTGA

Protein sequence:

>DPOGS203449-PA
MASSTDKTNRKCEIERLPSLKDINNTEKGTDHKIGLNFNWYKMRRPHLLYNNCGYREEQFRRLKKEYGFYSYCMLSVGLSFVLGLVIGAAIVGSPATSPNFIQRKLERINSSKVRVPHERLNSLKLTGNSDEQIKKERDGDRFSAVSFVGSDDPRKYKVDVYPKGLTDDMREILKDTKASDHKHSLPEVVNGMVHVPNDNTVLYNNIYWGPEVENSMPQGYGENSAEIWEKYVDQSEVIKMEAGCGRMQNRLITFQDGIQACVRYRQNTDQIQGEIFSFYVARLLNLTNLAPSVVKVVDLKDKLWQNVANDIATAQWNTNRAVVITQYIPSLDSATIPEIFKPSTRHLNKIDIYKMSVTEKNDTKQLLLDKIRAKNIKTKIEVADDFDYVDVKVNKKTIELFVELAQWSDLIVFDYLTANLDRIVNNLFNYQWNINIMDGPAHNLARKMDSGLLLFLDNESGLLHGYRLLKKYNTYHSLMLDNLCVFRKSTVDALKVMYRLPIGKKLSEVFHQKNSAVIRDILPPLPEKNAKILHERLGKVLAQGAMKAFLIPNNITYEVPVCKSLLFMSVSEL-