Monarch geneset OGS2.0

DPOGS203253
TranscriptDPOGS203253-TA4299 bp
ProteinDPOGS203253-PA1432 aa
Genomic positionDPSCF300210 + 223358-233822
RNAseq coverage462x (Rank: top 27%)
Annotation
HeliconiusHMEL0225040.066.56% 
BombyxBGIBMGA007035-TA0.065.44% 
DrosophilaHrs-PC9e-8760.18% 
EBI UniRef50UniRef50_G6CMC90.0100.00%Putative uncharacterized protein n=2 Tax=Coelomata RepID=G6CMC9_DANPL
NCBI RefSeqXP_393989.31e-9869.75%PREDICTED: similar to Hepatocyte growth factor regulated tyrosine kinase substrate CG2903-PC, isoform C [Apis mellifera]
NCBI nr blastpgi|3072015317e-10470.04%Hepatocyte growth factor-regulated tyrosine kinase substrate [Harpegnathos saltator]
NCBI nr blastxgi|3454868846e-9738.92%PREDICTED: hepatocyte growth factor-regulated tyrosine kinase substrate-like [Nasonia vitripennis]
Group
Gene OntologyGO:00068863.4e-39intracellular protein transport
GO:00468724.2e-29metal ion binding
KEGG pathwayame:4105104e-98 
 K12182 (HGS, HRS, VPS27)maps-> Endocytosis
    Phagosome
InterPro domain[6-142] IPR0089421.6e-45ENTH/VHS
[7-138] IPR0182052.2e-44VHS subgroup
[5-138] IPR0020143.4e-39VHS
[154-220] IPR0003064.2e-29Zinc finger, FYVE-type
[158-224] IPR0110111.2e-24Zinc finger, FYVE/PHD-type
[155-219] IPR0130831.4e-20Zinc finger, RING/FYVE/PHD-type
[644-700] IPR0096752.3e-11Xklp2 targeting protein
[382-511] IPR0220211.7e-06Cell cycle regulated microtubule associated protein
Orthology groupMCL13111 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203253-TA
ATGTTTCGAGCTAATAACTTTGATAAATTATTAGATAAAGCGACAAGTAATCTACGCTTGGACCCCGATTGGCCCTCTATCTTACAAATATGCGATCTAATTAGACAGAATGACTGTTCACCAAAATATGCAGTGGCTGCTGTTAAGAAGAAGTTGTATTCACAGAACCCCTACCAAGCTATGTTTGCACTTTTGACTCTTGAGAGTATTGTTAAAAATTGTGGATCTGGAGTTCATGATGAAGTCGCATCGAAGGCGTTCTGTGAAATGCTCCGTGATCTCGTTAAAACAACCCAACATGAGAACTTGAAAACCAAGATATTGGAATTAATACAAGCTTGGGCATTTGCTTTCCGGAACTCACCAAAATATAGAGCTGTACAGGACACTGTGAATATACTGAAAGCTGAAGGTCATAAGTTTCCACCGCAAAAAGAATCTGATGCAATGTTTTCTGCGGATACAGCACCAGAATGGGCTGATGGTGAAGTTTGTCACAGGTGCAGAGTAGCATTTTCTCTTATGGTTCGTCGACATCATTGTCGTGCTTGTGGTCAAGTGTTTTGTCAGCAATGTAGCTCCAAGACATCAACATTGCCGAAGTTTGGCATTGAGAAAGAAGTCCGAGTGTGTGAGGCTTGTTATGATAAAGTAAGCCGTCCACCATCTTCAACAGCAAAGTTGGAAATTGTAGATACTTCTAGTGACTATGGACCAGCCCAACCCCAATATTTCATTGAGTTTACCATGTCTAAAAATTTTATAACAAGATATCGAGGTGAAATTTACAATACACCTACTGGAAAATTATATATCAAAGATGAACCAATCTCACCAACTGAAGAAGATTTTGGTGTTCACTATAACTTGGACTATCCCATAGAAGACAACTATGCTATGAAAAAATCACTTTCTATGAACGACATAGCTTCGCTCAGAGAAGATTTTTTAAAACTTGAGGTTACTAGTAACCAGGATGATATGTATTCTAGATACAAGAAACATTGTGAAACCCAAATGCTATCAAAAACAAAGTTTGTTTCCGTGGCCGAAGCAATATATCACTATCAGAGAGATACACCAGACAGGTTTCACTCATCCAGACCTCGCATATTCCGTCCTCAACGTAACACGGGACACACTGGACTGACTGTGCCCCAATCACCAATGTTAAGATGTAAGGCTCGTTCTCGCCCTCAACATGTACTCTCACAGAAAGAAAAGGAAGAAATGGAACTTCAAGAAATTAAGAAATTTAAAATAAAGGCTAATCCTATTCCCAAAAGTGTAATTGAAGGCCCTAAGCATCTTCCAGAAGTGTCTAAGAAGCCTATCACAGTTCCAGAACCCTTTAATCTGACTGAAATACAAAAGAAGGTCGCTCAGTCCTCTGACCATGTTCAAAATTTCAAGGCAAGACCTGCACCTAAACATATTCTAGAGAAACCACATATCCCTACTAAGCCACCATCATCTTTAACAAAACCGGTTAGCCCCAAATTTCATTACAAGCGAGCAAATTCCGCCGATCACATCAAAAATGATATTAAAATTAGTAATCCACCTGTTAATGCCAAAAAGGCAGAAAAGTGTGAAAAAACTGTACAACGTCTTGGTCCTGTGAAACCTGAACCATTTTCGTTTGAGAAGAGAGATGAAGAATTAAAGAGGAAGAGAGAAGAACGCATCAAGCAGCAGATGGAGGAGGAACGGAGGCTTGCAACTCAATTCAAGGCCCAGCCGTTGCCAGCGGCTGTGAAGAAAAGGATGCAGAATGTTCATTCTAGTCATCCATCAGATGCCTCATCTGAAAACAAAGAAAACCAAACCCATGCAAAATTTGAAGCAAAACCTCCAGTTGTTCTCTACAAGGAGCCTTTCAAACCGGTACTTAAACCAGTCCAACTGAAAAAACCGATGCCATTCGATTTAACAACAACAAAGAGAGCTGCTGAAAGAGAACTGTTTGAAAAGCAATTGAAAGAGAAAGAAGAGGAAAATGAAAGATTGAGACTTCAAAAGGAAAGAGAAAAACAAGAGGCTGAGGAACGAGCTATAGCTGAATTAAGAGCGAAGCTGGTACATCACGCCAAACCAGTGCCAGCATTACACCCATTCATTCCTGGAAAGTCTGATGCACCTATAACAGTTCCAGAAACACCTAAATTTAAACGACCTGGCAAGTCAGCTGAGGAGTTGCAGGAAGAAGAGGAGTTGCAACTAGCCCTGGCCCTCAGTCAGTCTGAAGCTGAACATAAGGAGAAGGAACGCAAGTCAAGATCTTACATCGCACCAGAACAAACACTACATCCTACCATTTCTCCTACTCCCTCGTCTGTGAGTGCGTCCCCTGAACACAGTACGGCTAACTCGGAACTGTCCCGCTACTTGGACAGGAATTACTGGGAACAAAGACTGTCAAGGGATCCTGCTGCACCTACAGCTCCCTCCTCGCACGCCACCAATGAAACTCCTGATCAAGATTTCACAAAACCTTCCACTAGTAAGGGACAAGAAGATGAAGCAGAGGATAAAGAGATTGACGAATTTGTCGAATCACTCAAATCACAAGTGGAGATATTTGTTAATAGAATGAAGAGTAACTCGTCACGTGGTCGTTCAATCGCCAATGACACGTCTGTGCAAACATTGTTTATGAATATAACAGCGATGCACTCGAAGTTGCTGAAATATATTCAACAGCAGGATGATAAACGAGTTTATTTAGAGAGTTTGCAGGATAAGATCAGTCAGGTGCGTGATAGTAGAGCTGCTCTTGATACTCTTCGGGCAGAGCACGCGGCTAGACTTGCCCAGGCGGCTGAGGCTGCTGAGAGACAGAGACAAATGCAAATGGCCGCCAAGTTACAGGCAATGAGGAAAAAGAAACACGAGTACTTGCAGTACCAAAGACAGTTGGCACTGCAAAGAGTTCAGGAACAAGAAAGGGAAATGCAAATGAGACAAGAGCAGCAAAAGCATCAGTATATGATGTCAGCCAACAGTGGTTTTTACATGCCTGGTGTTTCGATGCATCAATTCCAGCCTGGTTATCCCAACCAACCAATGTACGCCAGTTCTCAATTCCATCCACAAATGATGTCCCAAGCAACAACCGATGGTTCCATGCCTTTACCTGGACAACCACAAATGTCACAAGCTAATATACCCGTGAATGTGAATCAGATGCCAATGTCCCAATCTATGCCACCCATGACAAATACTTTTGCTATGTCCACATCCGGTATGGGAATTAGCCAACCTGGTGGTCAATCTGCAATGAACCAATCTAATGTAAGGCCAGTAATCAGCCAGCAATTACCTATTCAACAGCAAATGATGATGCAGCAAATGCAACAAATGAGAATGCCAAACCAACCAGGTGTGCATCAAATGATGCAGCAAAATGGCCATCCTCCCCAAAATATACAAACACAGAATCAACAGTCCTCTCAGCAAATTCAGAAACAAAATGTCCCGAATCAACCTCCCAGCTCTATGATGCCTCCAATGGCAATAGGTCAAGCAAACATAAATCAACCTAATCCTAGTAACCCAATGTTTGGGATGCAAATGCCAAATATGAGAATGTCAATGATGTCGGCCAATCCACCCAGTAATAACCAACAAGTCCCAGTTCCTGGATATCCAATGCATGTACAAAACCCGCAAAATAATCAACCAACTGGGACTCAAATGTCTAATGTGCCTCAAATGCCATTACTAGGCCAACAATTACCAATGCCAAATCAGCATATAATACAAGGGCAGCCCATTCAAGTTGGTCAACCTCAAGTACAACAAATGCCACATCAACCACAAATGCACCAAAGTCAGTCATTACCACAAGGGGGCCAATCTCAGGGACCACAAATGCCACACCAACCACAAATGACCCAAAGCCAAACGTTACCAATGCAAGGTCATCCCTCACAAACACAACCACAACAAAATATGATGATGCAGCATCCAGGTCAACCTTTGCAACAAGGTCAACAGATCTCTCAGACCAACACACAAGCTATGAACCAAGGCCAACAAATACCGAACCAGATGCCAGCTAATATGCCACAAATGATGCAAGGTGCTATGCAACATAGTTCACAAATGCAGGTTCCCACAATGTCTAATCAGGGTCAGCAACCAGTACAACAGAATATGCAAGGTCAAACCATGAAGCAACCATCTTTCGGTAACGGCCCTCCAAACCAACAGCCTGTCAACAACCAACAGCAAGCCCAAACCCCATTAAAGTCTGAACATAACAATAACACTGCAGAGTTGATAAGTTTTGACTGA

Protein sequence:

>DPOGS203253-PA
MFRANNFDKLLDKATSNLRLDPDWPSILQICDLIRQNDCSPKYAVAAVKKKLYSQNPYQAMFALLTLESIVKNCGSGVHDEVASKAFCEMLRDLVKTTQHENLKTKILELIQAWAFAFRNSPKYRAVQDTVNILKAEGHKFPPQKESDAMFSADTAPEWADGEVCHRCRVAFSLMVRRHHCRACGQVFCQQCSSKTSTLPKFGIEKEVRVCEACYDKVSRPPSSTAKLEIVDTSSDYGPAQPQYFIEFTMSKNFITRYRGEIYNTPTGKLYIKDEPISPTEEDFGVHYNLDYPIEDNYAMKKSLSMNDIASLREDFLKLEVTSNQDDMYSRYKKHCETQMLSKTKFVSVAEAIYHYQRDTPDRFHSSRPRIFRPQRNTGHTGLTVPQSPMLRCKARSRPQHVLSQKEKEEMELQEIKKFKIKANPIPKSVIEGPKHLPEVSKKPITVPEPFNLTEIQKKVAQSSDHVQNFKARPAPKHILEKPHIPTKPPSSLTKPVSPKFHYKRANSADHIKNDIKISNPPVNAKKAEKCEKTVQRLGPVKPEPFSFEKRDEELKRKREERIKQQMEEERRLATQFKAQPLPAAVKKRMQNVHSSHPSDASSENKENQTHAKFEAKPPVVLYKEPFKPVLKPVQLKKPMPFDLTTTKRAAERELFEKQLKEKEEENERLRLQKEREKQEAEERAIAELRAKLVHHAKPVPALHPFIPGKSDAPITVPETPKFKRPGKSAEELQEEEELQLALALSQSEAEHKEKERKSRSYIAPEQTLHPTISPTPSSVSASPEHSTANSELSRYLDRNYWEQRLSRDPAAPTAPSSHATNETPDQDFTKPSTSKGQEDEAEDKEIDEFVESLKSQVEIFVNRMKSNSSRGRSIANDTSVQTLFMNITAMHSKLLKYIQQQDDKRVYLESLQDKISQVRDSRAALDTLRAEHAARLAQAAEAAERQRQMQMAAKLQAMRKKKHEYLQYQRQLALQRVQEQEREMQMRQEQQKHQYMMSANSGFYMPGVSMHQFQPGYPNQPMYASSQFHPQMMSQATTDGSMPLPGQPQMSQANIPVNVNQMPMSQSMPPMTNTFAMSTSGMGISQPGGQSAMNQSNVRPVISQQLPIQQQMMMQQMQQMRMPNQPGVHQMMQQNGHPPQNIQTQNQQSSQQIQKQNVPNQPPSSMMPPMAIGQANINQPNPSNPMFGMQMPNMRMSMMSANPPSNNQQVPVPGYPMHVQNPQNNQPTGTQMSNVPQMPLLGQQLPMPNQHIIQGQPIQVGQPQVQQMPHQPQMHQSQSLPQGGQSQGPQMPHQPQMTQSQTLPMQGHPSQTQPQQNMMMQHPGQPLQQGQQISQTNTQAMNQGQQIPNQMPANMPQMMQGAMQHSSQMQVPTMSNQGQQPVQQNMQGQTMKQPSFGNGPPNQQPVNNQQQAQTPLKSEHNNNTAELISFD-