Monarch geneset OGS2.0

DPOGS213569
TranscriptDPOGS213569-TA1470 bp
ProteinDPOGS213569-PA489 aa
Genomic positionDPSCF300033 + 25810-32060
RNAseq coverage1051x (Rank: top 12%)
Annotation
HeliconiusHMEL0108635e-14478.29% 
BombyxBGIBMGA011641-TA6e-10283.03% 
DrosophilaCG3529-PB7e-11045.21% 
EBI UniRef50UniRef50_UPI00022C95733e-12751.84%UPI00022C9573 related cluster n=3 Tax=unknown RepID=UPI00022C9573
NCBI RefSeqXP_001122551.11e-12951.15%PREDICTED: similar to CG3529-PB [Apis mellifera]
NCBI nr blastpgi|2700042013e-13251.98%hypothetical protein TcasGA2_TC003525 [Tribolium castaneum]
NCBI nr blastxgi|3072130144e-13151.73%TOM1-like protein 2 [Harpegnathos saltator]
Group
Gene OntologyGO:00068869.8e-44intracellular protein transport
GO:00056221.2e-17intracellular
KEGG pathwaymdo:1000267919e-22 
 K04705 (STAM)maps-> Endocytosis
    Jak-STAT signaling pathway
InterPro domain[1-489] IPR0146452.4e-125Target of Myb protein 1
[14-150] IPR0182059.1e-61VHS subgroup
[6-155] IPR0089426.5e-59ENTH/VHS
[9-150] IPR0020149.8e-44VHS
[207-302] IPR0041521.2e-17GAT
Orthology groupMCL12427 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213569-TA
ATGTCGTTTTTTGGAGTAGGGAACCCATTTGCTACGCCGGTTGGTCAGAAGATAGAACAAGCTACCGATGGATCTCTACCATCAGAGAATTGGGCCCTTAATATGGAGATATGCGATATAATAAATAGTAGCACTGACGGACCCAAGGATGCAATTAAGGCCATAAGAAAGAGGTTGACTCAAAGTGCAGGCAAAAACTACACCGTGGTCATGTATACACTAACTGTATTGGAAACCTGTGTTAAAAATTGCAGCAAAACATTTCATGTTCTAGTATGCAATAAGGAGTTTATATCAGAACTGGTCAAACTCATTGGACCCAAAAATGATCCACCGACTGTGGTACAGGAGAAGGTTCTCAGCCTCATTCAATGTTGGGCGGATGCTTTCCAAAACCAACCTGAATTACAGGGCGTTGGTCAAGTGTATAATGAATTGCGCAACAAAGGTGTCGAGTTTCCAATGACAGACCTAGATGCCATGGCGCCAATTTTCACACCGCAGAGGAGTGTGATTGACGGCGGGGAGCCGGTAGTTGGTTCTCCACAACGTACTATTCAGCAGAACTCTCCAAGTCGACCATCTCAGGAGCAAGTCGTAGGGACAATTCTGTCAGACAGTCAGAGCAGTAAGCTGCGGGCGGACCTGTCAGTGGTTGAGGGGAACATGACAGTCATGAACGACATGCTGACCGAACTCACAAGCCTTCCATACACACAGCATCACGAACAAGACATCGAACTGCTGAATGAGCTAGCAGACACCCTGAAGGCGATGCAGACTCGTGTCGCTGAGCTGGTGGGTCGTTTAGGGGAATCCCCGCTGACGGCCGACCTTCTGCTGACCAATGACCGCCTCCACAACCTGTTACTGAGACACTCCAGGTTCATCAATAACAGAATTGCCGCGACTGGTGGGGCGACGCCATCCGCCATTTTGGGCGCCGCCATGGGTGTGCCGGGCGCCACTTCACCTGAGAAAAAAGATGACGACGCTTTAATTGATCTCAGTGATGACGTACCCGATGTTGCTAAACTATCTGTTAAAGACGATACAATCGATAAATCACCAAGCAGTTCCAAGGATGAGTTCGATATGTTTGCTCAGTCTAGGAACGTTACCTATGAGACCACTAAGACGGGCGGCAGTAGCTATGCTGATAATGCTGAAGCTCCCGTCGGAGGTCTAGGAACCGCTATGAGGGCTCATAACACACTGCCGACAGGAATGGATCAAAGGGAATTTGATGAAATATCGGCTTGGTTGGCCCAAGAGAAGGCAGCGACTGAAGCTAACGGACAGGAGAGTGTGACGAGCAGCGATTTTGACAAGTTCCTCGCCGAGAGGGCCGCGGCCGCAGACAGCCTGCCGAATGCTGGCCAGGGACAAGGTCAGGGTCAGGGTCAGGGTCAAGCGACGCCCCGCCACCGCCACATCAAGAAGGACGAGGACTCCATGTTCGCGCTATGA

Protein sequence:

>DPOGS213569-PA
MSFFGVGNPFATPVGQKIEQATDGSLPSENWALNMEICDIINSSTDGPKDAIKAIRKRLTQSAGKNYTVVMYTLTVLETCVKNCSKTFHVLVCNKEFISELVKLIGPKNDPPTVVQEKVLSLIQCWADAFQNQPELQGVGQVYNELRNKGVEFPMTDLDAMAPIFTPQRSVIDGGEPVVGSPQRTIQQNSPSRPSQEQVVGTILSDSQSSKLRADLSVVEGNMTVMNDMLTELTSLPYTQHHEQDIELLNELADTLKAMQTRVAELVGRLGESPLTADLLLTNDRLHNLLLRHSRFINNRIAATGGATPSAILGAAMGVPGATSPEKKDDDALIDLSDDVPDVAKLSVKDDTIDKSPSSSKDEFDMFAQSRNVTYETTKTGGSSYADNAEAPVGGLGTAMRAHNTLPTGMDQREFDEISAWLAQEKAATEANGQESVTSSDFDKFLAERAAAADSLPNAGQGQGQGQGQGQATPRHRHIKKDEDSMFAL-