Monarch geneset OGS2.0

DPOGS208952
TranscriptDPOGS208952-TA1182 bp
ProteinDPOGS208952-PA393 aa
Genomic positionDPSCF300009 + 417844-420930
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0115570.079.39% 
BombyxBGIBMGA002424-TA3e-17476.86% 
DrosophilaCG2224-PA5e-10248.40% 
EBI UniRef50UniRef50_E0VIA92e-10853.66%Predicted protein n=1 Tax=Pediculus humanus corporis RepID=E0VIA9_PEDHC
NCBI RefSeqXP_001844659.13e-11152.39%amsh [Culex quinquefasciatus]
NCBI nr blastpgi|1700335895e-11052.39%amsh [Culex quinquefasciatus]
NCBI nr blastxgi|1700335898e-11153.96%amsh [Culex quinquefasciatus]
Group
Gene OntologyGO:00055151.5e-18protein binding
KEGG pathwaycqu:CpipJ_CPIJ0025908e-111 
 K11866 (STAMBP, AMSH)maps-> Endocytosis
InterPro domain[233-338] IPR0005551.5e-18Mov34/MPN/PAD-1
[43-118] IPR0150639e-08Domain of unknown function DUF1873
Orthology groupMCL13363 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208952-TA
ATGCAGTCAATTGAAAAAAAGGGTAAAACAGTGGATCTAGCATCCTTAGAACCAGCAGCTAGGGTTAAGCAGTTGGCGAACTATGGGGCAATGGTAGATGTTGATCCAAACGTTCCTCCTAGAAGGTATTATCGATCTGGATTAGAAATGGTGAGGATGGCAAATGTGTATCTAGCTGAGGGCAGCTTAGAAAACGCTTATATATTATACATGAAGTTTATGACCCTGTTTGTAGAGAAGATTCGGAAACACCCAGAGTACAACACTGTGCCATCAGAAGTTAAAGCGGTCAATCAGAGTAAACTAAAAGAAGTTATGCCAAAGGCAGAAAAGTTAAAGCAAAAGTTGTTAGATGTCTATGCTAAAGAACACACTCTATATATTGAAAATGAGGCAAAGAGAAAAATAGCAGAGGAGGCAAGAAGAAAGCAAGAGCAAGAAGATGCGAAAGTCGCTCAAAGACTTCAAGCCGATGAGAACAGACAAGATGGTCACAGCACTACACCATATTTATTACATGCGGATCAATGGGCTGTGACACCAACAGCACCTCCGGTGGATGATGTGTTGTATCCGGATGACTTTGCAGATCCTCCTCGTTCATTGCCAGGTGTTCCATCATCTGTACCCCCCGCCATCATTCCACCGTCAAGACCTGCCGTTGATTCAAGTGGACTGTTGGATGCGAGGCGTCTTCGTACGGTTGTGATACCCACGGCACTGTTGCCAAGGTTTTTATCACTGGCTGCCCAAAACACAGCCGCAAACAAAGAAACTTGCGGCATACTAGCTGGTAGACTGGAACAAAATCAATTGAAGATCACGCATGTGGTGGTGCCCAAACAGACGGGAACATCAGACTCGTGTAGTACAAACAACGAGGAAGACATCTTTGAATACCAGGACAAACACAATCTCATCACCTTGGGATGGATACATACCCATCCGACCCAGACGGCATTCCTTTCATCTGTGGATCTTCACACACAGTGCTCCTACCAGCTCATGATGCCGGAAGCCATTGCTATTGTCTGCGCACCAAAATATCAAGAGACCGGTTACTTCGCTCTAACTCAGGACCACGGTATGTCGTTCATAGCCAAATGTCGTCAGCCTGGGTTCCATCCACATCCATCAGATCCACCGCTGTTCTACGTATGTATATGTTATGTTCCTAAATAG

Protein sequence:

>DPOGS208952-PA
MQSIEKKGKTVDLASLEPAARVKQLANYGAMVDVDPNVPPRRYYRSGLEMVRMANVYLAEGSLENAYILYMKFMTLFVEKIRKHPEYNTVPSEVKAVNQSKLKEVMPKAEKLKQKLLDVYAKEHTLYIENEAKRKIAEEARRKQEQEDAKVAQRLQADENRQDGHSTTPYLLHADQWAVTPTAPPVDDVLYPDDFADPPRSLPGVPSSVPPAIIPPSRPAVDSSGLLDARRLRTVVIPTALLPRFLSLAAQNTAANKETCGILAGRLEQNQLKITHVVVPKQTGTSDSCSTNNEEDIFEYQDKHNLITLGWIHTHPTQTAFLSSVDLHTQCSYQLMMPEAIAIVCAPKYQETGYFALTQDHGMSFIAKCRQPGFHPHPSDPPLFYVCICYVPK-