Monarch geneset OGS2.0

DPOGS213337
TranscriptDPOGS213337-TA2007 bp
ProteinDPOGS213337-PA668 aa
Genomic positionDPSCF300109 - 535356-538508
RNAseq coverage222x (Rank: top 45%)
Annotation
HeliconiusHMEL0144990.072.28% 
BombyxBGIBMGA009144-TA0.073.85% 
DrosophilaCG5543-PA0.054.85% 
EBI UniRef50UniRef50_Q9W1J30.054.85%Gastrulation defective protein 1 homolog n=27 Tax=Neoptera RepID=GAD1_DROME
NCBI RefSeqXP_393688.20.059.70%PREDICTED: similar to CG5543-PA [Apis mellifera]
NCBI nr blastpgi|3800272160.059.62%PREDICTED: gastrulation defective protein 1 homolog [Apis florea]
NCBI nr blastxgi|3071863780.060.24%Gastrulation defective protein 1-like protein [Camponotus floridanus]
Group
Gene OntologyGO:00055155.3e-49protein binding
KEGG pathway 
InterPro domain[175-516] IPR0110465.3e-49WD40 repeat-like-containing domain
[165-515] IPR0159433.6e-47WD40/YVTN repeat-like-containing domain
Orthology groupMCL13862 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213337-TA
ATGAGTAAAAAGCCCATTTCATTCGGAAAAATTAGTTTTGCCATCAACACATTGGGACAGAATAATGTCGAGAAGGATAGCAATTCAGGCTTTGGCACGTTCGGTCGCACGCCGATTATGGAACAAAAGGATATCGAGGATATTTCCGAAGACCTAGAGAGCCAAAACTTGCATCAAGTAATGGGTATTAAAAATTTTGGTAAAAAAGCAAAATGCTATAATATTGATGAAATGGTAGAACAGGCGCGTTTAACCGCTCAAGAAAACAAAAAAAAAGTAGAAGCCGAAAATGCAAATAAAGAAAAGGATATGCAAGGTACTAAACTTTTAGGGCCAGTTAATGTGTCTACAAGTAATGATGATCAAGATGAAGAGGATGATGAACCTATCGGTCCTCCCATACCAGCGAATCTGTTACCAACTTTACACACCAAAGAACACAAAAACAATGAAACAAAAAACTCTGATAACAGTGATGAGAAGAATCAACAAAGTGAAGACAGCTATGACGATCTCAGCGGCTCGGATGACGAAGAGTTATCAATAGAGAAAAGGATACCAAATACACATGAGGTAGAAATGCAACATGGAAGCAAGGCAGTAGTAGCCGTAGCGGTGGACCCCTCAGGAGCTCGACTGGCTACTGGCTCAGTTGATTACGAAGTTTCATTTTGGGATTTTGCTGGTATGGACACGTCGATGCGTTCGTTCCGCACGCTCCAGCCGTGTGAGAACCATCCCATCAAGGCCCTGCAGTACTCCGCCACCGGGGACTCCATACTAGTGGTGAGCGGCTCCGCTCAGGCCAAGGTGCTGGACCGGGACGGGTTCGAGGTCCTGGAGTGTGTGAAGGGCGACCAGTACATCACAGACATGGCCAGGACCAAAGGCCACACGGCGTCGTTGAACAGCGGCTGCTGGCATCCGACCGTCCGCGAGGAGTTCATGACCTGCTCCCAGGACGGCACCCTGAGGTTGTGGCTCACGGACAACAGCAAACAACACAAGGACGTCATCAAGCCGCGGCAGAGAGGAGGCCTCAAGACCAACCCCACGACGTGCGCCTTCACGAGGGACGGGAACACCGTGGCCTGCGGCTGTTACGACGGCTCCATACAGATGTGGGACCACCGGAGGAACTTCGTTAACACCTCCTTGATCCTGAGAGACGCTCACTCCGCGCACACCGAGATCTCCTGCATCGCCTTCTCCTACCTGGGCTCCTACCTGGCCAGCCGCGCCAACGACAACACCCTTAAGATATGGGATCTGAGGAATCACCGGAAGCCGCTGAACGTGTTCGGCGACCTGTTCTCGAGGTACGACCAGACCGACTGCGGCTTCAGTCCGGACGACTCCATGGTGTTCACCGGAGTGTCGGTGCAGAACAACCGGGAGCCCGGTAGATTGATCTTCTTTAATTCCAAAACATTCGAGAGAGTCACCGAAGTCCCAGTGACCGCGTCTCACGTCATCAAGGCCGTGTGGCACGCCAAACTCAATCAAATATTCGTGGGTTGCGGTAACGGGATCGTGAAGTGTTACTACGACACCAAGCGGAGTCTGCGAGGAGCCAAACTCTGTATCGTGAAGACGCACAGGAAGAAGCAGATGGTGGAGGTGGTGAGCTCTCAACAGATCATAACGCCTCATGCGTTGCCGCTGTTCAGGCAAGAGAAACTCAGGACGAGTAAGAAGAAAATGGAGAAAGAGAGAATGGATCCGGTCAAATCCAGGAGACCGGACTTGCCGATCACTTCCGGCCAAGGAGGACGAGTGGCCGCCTCGGGCAGCACGCTGAGCTCGTTCGTGATCAGAAACCTAGGGCTGAGCAAGAGAGTGGACGACGAGCAGGACCCGAGGGAAGCCATCCTGAAATACGCCAAGGAGGCCGAGGAGAACCCCTTCTGGGTCGCGCCAGCCTACAAGAAAACACAACCCACGCCCATCTTCCAAGACGACGAGGACGGACCTAGCGACGCCAAGAAGACCAAATCTGATACTTGA

Protein sequence:

>DPOGS213337-PA
MSKKPISFGKISFAINTLGQNNVEKDSNSGFGTFGRTPIMEQKDIEDISEDLESQNLHQVMGIKNFGKKAKCYNIDEMVEQARLTAQENKKKVEAENANKEKDMQGTKLLGPVNVSTSNDDQDEEDDEPIGPPIPANLLPTLHTKEHKNNETKNSDNSDEKNQQSEDSYDDLSGSDDEELSIEKRIPNTHEVEMQHGSKAVVAVAVDPSGARLATGSVDYEVSFWDFAGMDTSMRSFRTLQPCENHPIKALQYSATGDSILVVSGSAQAKVLDRDGFEVLECVKGDQYITDMARTKGHTASLNSGCWHPTVREEFMTCSQDGTLRLWLTDNSKQHKDVIKPRQRGGLKTNPTTCAFTRDGNTVACGCYDGSIQMWDHRRNFVNTSLILRDAHSAHTEISCIAFSYLGSYLASRANDNTLKIWDLRNHRKPLNVFGDLFSRYDQTDCGFSPDDSMVFTGVSVQNNREPGRLIFFNSKTFERVTEVPVTASHVIKAVWHAKLNQIFVGCGNGIVKCYYDTKRSLRGAKLCIVKTHRKKQMVEVVSSQQIITPHALPLFRQEKLRTSKKKMEKERMDPVKSRRPDLPITSGQGGRVAASGSTLSSFVIRNLGLSKRVDDEQDPREAILKYAKEAEENPFWVAPAYKKTQPTPIFQDDEDGPSDAKKTKSDT-