Monarch geneset OGS2.0

DPOGS210399
TranscriptDPOGS210399-TA1803 bp
ProteinDPOGS210399-PA600 aa
Genomic positionDPSCF300291 + 101646-136465
RNAseq coverage183x (Rank: top 49%)
Annotation
HeliconiusHMEL0210543e-7883.64% 
BombyxBGIBMGA008410-TA5e-9976.82% 
Drosophilasdt-PG8e-4250.00% 
EBI UniRef50UniRef50_Q7PQW51e-4456.89%AGAP002711-PA n=2 Tax=Anopheles gambiae RepID=Q7PQW5_ANOGA
NCBI RefSeqXP_974746.22e-5847.23%PREDICTED: similar to AGAP002711-PA [Tribolium castaneum]
NCBI nr blastpgi|1892361435e-5747.23%PREDICTED: similar to AGAP002711-PA [Tribolium castaneum]
NCBI nr blastxgi|1892361437e-7046.24%PREDICTED: similar to AGAP002711-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055158.9e-06protein binding
KEGG pathwayaga:AgaP_AGAP0027115e-42 
 K00942 (E2.7.4.8, gmk)maps-> Purine metabolism
InterPro domain[55-102] IPR0014788.9e-06PDZ/DHR/GLGF
Orthology groupMCL18336 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210399-TA
ATGAATTCGTTCAACGCGCCTAATCAAATCAACGCTATTGACAAAATGACCACCGTGAAATGCAAGTCTGTCGATTTAGAAGGATACGTCATAATAGTCGTGCAGACCAAGGATAATAAAATAAAACTATACGGTTCACCTTCCGCCGGCAACTGGGAGAATCTGGAGCTGGCTGACGAGATCATGGATGTGAACGAAACCAAGCTAGAAGACATGACCAGGACTGAAGTACTCACACACATTCATGAGTGTATTTCATCGTGTGTCATCAAACTGCGAGTAAAGAGACGGAGTGAAACAAAGCTTTTATCAGACATCGGTCATAACGTGATCCAGGATGCCTTCCTCATTGCTGTGGAGGAACAGGCGAGGCAGCGTCTCCAACGACTGTCAGCTCTCAAGAGAATCACTCCAGTTGATATGTCCAAGCTGTCTGCTGAATTAAACAAAAGAAAACGAACGCAAACTGAGAGCCGACAGGAGTTGAACGGTTACATAGCAAACTCCACAGTATACGTGACTTCTATAAACGAGAACGGAGCTATCGAGAGCAAACCCACAATATCATCACCGAAATCACAACCGAAGTCCCTCCAAGCGAAGCCAGCACCGCTCATAGCCAACGGCATCCAAGACAAGAAGGACGCCGTCGAGGACGCAGTCGAACCAGAGAAACACGAAAAAGATTCTAGTGAGAAAAGTTTTAAGGTTGATAGTGAATTGAAGGAAAGTGAAGTGTTGAGTGTTAAAAGTGATGCGTACAATGTGGCTAGAGAGCATCTGAATAACGGGAGGTGCGAGAAACTGCTGGGGGAGCAACCGGATCATCGGATACGAGCCACTGCTGTAGTCGAGAACAGTAAACTAGGACCGGGAGAGAAAGGGCCTCGCAGACGTTCTGGCTCTAGCATTGTGGTGCTAGGAGCTGAAGAGGAGAAGCTTCCGCCCCCGGATGACGAGACAGACATGCTCACCATGCTCTCACTCACCACGGATACTGGCCCCCACCGCGAGATGGCGGTGGATGTCCCAGACAGCTTCATCGCTAGGAATAAGACCCCGCCGCGCTACCCGCCACCCCGCCCACCACAGGTACGCCCTCGTGCCAGAGATGATGAGCCCCTCTTATCATCAGGCGGCTCAGGAACTTCTTTTGGCAGCAAACAGAGCACCGTCCTCCAAAGCAACGTCGACATATCCGGCCCAAGCTCCGACGGCTCAGGCAGTTTCAAGATTAACAGCACCATGGAATTACTGTCATTACCCCAGAGCTCGGACAGTTCTGGGGTAAGTTTTGGCAGCAAGAAAAGCAAGGGTTCTGCGGAAACAGCGAAATGCGATATAGCATCAGAAAGGTCCGAATCGGTGGATTCTATTCTGTCACACAAAATACAACTGCTGATATCGAACGGTGAAGATTCTCTCTTGGATTGTAAGGAGGGTGTGACGTACTCCACGCGGACTGATTCAGTCCTCATCAGGGAAGCTGTGTACGCTTCTTACAAGCTGCCTCCGGGGTTTGATACGACGCCCGTGCTCCTTGGACGGGAAGTGGCGGTGGATGTTCCTGATAGTTTCGTTCAGATAGTGAAAACTACCCCAAAGTACCCGAACACTGCTGACAGGAAGACGTTTCAGGTGAATGGAACAGCGAAGGGTGCGCCTGTAGTGCCCCCTCGCGAGGCGCCTCGTGATGCTCCGCCCAGGCCGCCCGCTCACGATCTCACTAGAGAGCAAGTTGATTCAATCAAGAAGTATCAGGACTTTAAAGATGAGAGTCGATTTGCACAACGTAATGTGTAG

Protein sequence:

>DPOGS210399-PA
MNSFNAPNQINAIDKMTTVKCKSVDLEGYVIIVVQTKDNKIKLYGSPSAGNWENLELADEIMDVNETKLEDMTRTEVLTHIHECISSCVIKLRVKRRSETKLLSDIGHNVIQDAFLIAVEEQARQRLQRLSALKRITPVDMSKLSAELNKRKRTQTESRQELNGYIANSTVYVTSINENGAIESKPTISSPKSQPKSLQAKPAPLIANGIQDKKDAVEDAVEPEKHEKDSSEKSFKVDSELKESEVLSVKSDAYNVAREHLNNGRCEKLLGEQPDHRIRATAVVENSKLGPGEKGPRRRSGSSIVVLGAEEEKLPPPDDETDMLTMLSLTTDTGPHREMAVDVPDSFIARNKTPPRYPPPRPPQVRPRARDDEPLLSSGGSGTSFGSKQSTVLQSNVDISGPSSDGSGSFKINSTMELLSLPQSSDSSGVSFGSKKSKGSAETAKCDIASERSESVDSILSHKIQLLISNGEDSLLDCKEGVTYSTRTDSVLIREAVYASYKLPPGFDTTPVLLGREVAVDVPDSFVQIVKTTPKYPNTADRKTFQVNGTAKGAPVVPPREAPRDAPPRPPAHDLTREQVDSIKKYQDFKDESRFAQRNV-