Monarch geneset OGS2.0

DPOGS207340
TranscriptDPOGS207340-TA3477 bp
ProteinDPOGS207340-PA1158 aa
Genomic positionDPSCF300188 + 126966-135144
RNAseq coverage208x (Rank: top 46%)
Annotation
HeliconiusHMEL0022040.082.15% 
BombyxBGIBMGA010268-TA0.074.91% 
DrosophilaSIDL-PA0.037.61% 
EBI UniRef50UniRef50_E2B8E70.046.41%Trafficking protein particle complex subunit 10 n=9 Tax=Formicidae RepID=E2B8E7_HARSA
NCBI RefSeqXP_623870.10.047.12%PREDICTED: similar to CG6623-PA [Apis mellifera]
NCBI nr blastpgi|3227854040.046.80%hypothetical protein SINV_09205 [Solenopsis invicta]
NCBI nr blastxgi|3071881130.046.77%Trafficking protein particle complex subunit 10 [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[991-1148] IPR0222333e-22Trafficking protein particle complex subunit 10
[454-542] IPR0217737.3e-07Foie gras liver health family 1
Orthology groupMCL11875 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207340-TA
ATGAAGGGGGTCGATAACTCATTATCGGATCACGAAGGCATGACACACAAACCCATTATAACATGTGCAGGAGATTGGAATTTATTTTGCACACTGGAACAGCCGCTGGTAGCTGCAATACCGCAAGATTCTTGTGAATGGCGAAGGTCCTACGGCCGTATAACTAAACTAGTGTCTTTAGAGGCATCTTTCATAAAGTTTAATAAGGACAAATTGAAAACAGAGCTCAACCTTTTGAACCGACCTATTTTTCACATTTATTGGACTGATTGTGTGGATATAGAATATTACAAAACCACATTGAGGGAAGATATAGAAATATGGCTGAAACAGTTAGAAAAACACAATGTTACAGACTGGATGATAGTATTAGTAGAAACTTATGATATAAGAAAAACAAATAAACTTCTTCCAAGAACTACAGTTTTAGATAAAATTAAGGGAGATTTTGCTGTGAAACAGACAGAAGACAGGTTTATTTCGGTTATCAATCCTATAAAATCTGAGGCTAGGAGTGCGGACTCATGGAGAACATTAGTGGCGAAAGTGAGGCATCTGGTTTTAGTGGCATACAATAAGGCTCTAATAAAATTTGAAGAGCATATGAGGGAACAGAGAGAAAGCAGGAATGATCCGGAATGGGATTTTTGTAAATATTTTATATTACAGGAGCAATTGGCGTTTGTTCTAGAGATGCTCGGTTTGTATGAAGAAGCTCTAGTCCAATATGACGAATTAGATGCTTTGTTTTCTCAATTTGTATTAAATTCAAACGTGACGGAGAGTCCAAAATGGCTTGAGACATTCAAACAGCCAATAACATCATGGCAGGCGGTGAGATTGACGGCCCTAGTGCCGCAGAATTTAAGAGAATTAATAATTAAGAATAAAGCATCTCTATTAGATTTTAGGAGTTATTTATTTCAACGACAAAGTGCCATGTTATTACCTACTTTTAAGCCATGGGAGATAGCATCCCGATGTCTCACCACGGTCCATAATACACTGGTTGAGGTGTCTCTGCTAGGAGCACATGCTGTGGATGGAGCGGCGGCTTGCTGGGCTCACTTGGCCTGTGTGGAGACATTAAGAGCATGTGAAAGATTGTCCTCCACCAACAGTGCGCTGGAGGCGTGCACAGCAATGCATGCACCTCTGCTGCATAATGCTAAAGATAAGTTACATGAATTGGGCAAGTTGTGTGGACTCCTCCCAGGATGTCCAGACCCTACATCAGAACAACTCCACTTGGTGGTGATGCTGTCAGCTGGTATGGGAGATAGCGAACCGAACCAACAAACACCAACAGACAGATTGAAAGAAGCTTTATGCAGCAAGATCTGCTTCCAGAATTATTATCTAGAATTGGCTGAGTTGGCTATGGGGACATACAAGCACATTGGAAGACTTAGATTCGCCCGTCAGATCGGAAGAGACCTGGCCTCGTTCTACTCGGAACTGGGTGAGAGCAGTAAAGCTGTGGTGTTCCTGACGGAGGCGCTTCGGTCTTATGAAGAACAGGGCTGGAGAGATCTGGCAGCGCAGACGAGGCTCGAGCTGGTGGCTGCGGCTTGCAAGATGAAGGATAGAGATAGATATACAAAGCTCTCAGCTAGAATAGCCAGCACAGCGGAATTAGAAATTCTAGTACGGAATTTCTATTTTGAGGAAATGATGAAATCTATAAAGGAAACTGATAAACAGGAATCAGTGTTAACCGAGCTCAATGACTGCTTTAAAATAGTATCAGTTAATATACTGCCGTCCGAACGTGGTGTCTACATCACAGATAATAAGGTTCAATGTCGCTTGGTGATCGAAAGCTTGATGCCCAAAGATGTTTTATGCAACAAAGCGGCTATATGTGTTGATAGTGTGAAGCAAGATAAGACTCCAGTGAAGACTTATAAGACTGATCTAACTGTAGGCAAACAATCTAGTTCACCGAGGAAATCTAATATTGATAATACTGATAGTGATTCCAATAATTTAGTTACTAGTATAAGACTAGAAGATTTAAAAGCAAAAAATTCATTCCTTAACAAAATGAACATAACGTCTAAATTGCATTACAAAGAAGACAGGACACTACAGAAGGCTACCGTCGAATGCTCTCACCCAAAAGTAACTTTAAGGAGATCAGACAGCAGTAAATATAGAAAACCGTCTGCTACAATACGGAATAACTACGAGACGTGTTTAGCAACTGACCATATTATTTTGAAACCCGGTTTAAATGAAATATTACTGGAATATGTACCAAAGATGTGTGGTTTATTCAAATTGGGACAAGTTTCGTTGTTAATTGAGGGTAGACTTGAATTTCTATCGAATGCATTGATACAGTGCAAGCTTGGTTATGACGTGGAGACGAGAGGTGTCAGCGTGTACTTGAACAAAGTCGAACCAAAAAAGGATTTGGTTGCCGGCTTAGAGGAAGATGTTGAATTGGTTGTGACCAGCGGCAGTTCTAGAATAGAAGAGAATTCAATAATTCAGTTAAAAACATCGACCGGACTCCAAATACGTTTCACAGATTCGAATCTGTCAAGAGAGTTGTCTATGCCAATAGAGTCTATAGAGCCGTTCCAAACGACCAAAGTAGGGCTCAAGTTGTTTGCTAATCTTCAACCTAGAAGGGAAAAAAGTATAGAACATACTGTTTGGCTCCACTGTCCGTGGTGGGAGACCGTGACGGAGGTGCCCTTACACTTTACACCGCCCATGATAGCCTCCTGGAGGTTACTGACTTCCAACACCAGGAAGTTCATTCATATCACCCTCAAATCAACCATCGTGCATCTCGCTCAGTTCGTGCTGAGTGATCCTGTGCTAGAGTGTGACAATGATAATACTGTGGCGGATTTGAATCCAAAGAACGCTGGGGATATGATAGTAGCGTCCGATGGCACCACCAGCTCGTTCATGTGGGAGCTGCTTAAGGATCCTCTGGTGAAGGCTGGGCCGATGAAGGCGGTGTTCAAGGTCAACTATAGATTACTTGAAGAAGATATATCCAGACAATTTACTTGCCCTTTTGATATACAAGACTATACCACTCTTTTTGTTGTGAGAACTAAGTTGGAGCCATCCAAGGGTTCTGACTTCTGTAGAGCTTCACAAGTCTGCTGTTTACAGTTGACTGTTCAAAGGGTAAATGAAACAGAGCACACTTCTCTAATGTATGAAGTGCTTGCGGATCAAACCATGTGGGCGGTGCTGGGACGAACTGCGGGTGTTATAACAATGGAGTCCAATTCTGAAGGTCAATGCGTGAACCTGGATGTGATGCCACTGGTGGCTGGATACCTTCCACTACCAGCTGTCAGGTTGTCGAAATACATCGCTGCTAACACTAGAGACCCTTCCTCCCATCCAAGATTGGAGCCGTTCAGTCCTGGCCAGGTGTACCACGCGGGGAAGGCAAGACAGTTACACGTTCTACCACCCCTCACCAAAGAACATGATAATATCTGA

Protein sequence:

>DPOGS207340-PA
MKGVDNSLSDHEGMTHKPIITCAGDWNLFCTLEQPLVAAIPQDSCEWRRSYGRITKLVSLEASFIKFNKDKLKTELNLLNRPIFHIYWTDCVDIEYYKTTLREDIEIWLKQLEKHNVTDWMIVLVETYDIRKTNKLLPRTTVLDKIKGDFAVKQTEDRFISVINPIKSEARSADSWRTLVAKVRHLVLVAYNKALIKFEEHMREQRESRNDPEWDFCKYFILQEQLAFVLEMLGLYEEALVQYDELDALFSQFVLNSNVTESPKWLETFKQPITSWQAVRLTALVPQNLRELIIKNKASLLDFRSYLFQRQSAMLLPTFKPWEIASRCLTTVHNTLVEVSLLGAHAVDGAAACWAHLACVETLRACERLSSTNSALEACTAMHAPLLHNAKDKLHELGKLCGLLPGCPDPTSEQLHLVVMLSAGMGDSEPNQQTPTDRLKEALCSKICFQNYYLELAELAMGTYKHIGRLRFARQIGRDLASFYSELGESSKAVVFLTEALRSYEEQGWRDLAAQTRLELVAAACKMKDRDRYTKLSARIASTAELEILVRNFYFEEMMKSIKETDKQESVLTELNDCFKIVSVNILPSERGVYITDNKVQCRLVIESLMPKDVLCNKAAICVDSVKQDKTPVKTYKTDLTVGKQSSSPRKSNIDNTDSDSNNLVTSIRLEDLKAKNSFLNKMNITSKLHYKEDRTLQKATVECSHPKVTLRRSDSSKYRKPSATIRNNYETCLATDHIILKPGLNEILLEYVPKMCGLFKLGQVSLLIEGRLEFLSNALIQCKLGYDVETRGVSVYLNKVEPKKDLVAGLEEDVELVVTSGSSRIEENSIIQLKTSTGLQIRFTDSNLSRELSMPIESIEPFQTTKVGLKLFANLQPRREKSIEHTVWLHCPWWETVTEVPLHFTPPMIASWRLLTSNTRKFIHITLKSTIVHLAQFVLSDPVLECDNDNTVADLNPKNAGDMIVASDGTTSSFMWELLKDPLVKAGPMKAVFKVNYRLLEEDISRQFTCPFDIQDYTTLFVVRTKLEPSKGSDFCRASQVCCLQLTVQRVNETEHTSLMYEVLADQTMWAVLGRTAGVITMESNSEGQCVNLDVMPLVAGYLPLPAVRLSKYIAANTRDPSSHPRLEPFSPGQVYHAGKARQLHVLPPLTKEHDNI-