Monarch geneset OGS2.0

DPOGS215544
TranscriptDPOGS215544-TA3375 bp
ProteinDPOGS215544-PA1124 aa
Genomic positionDPSCF300129 - 135326-170411
RNAseq coverage1042x (Rank: top 12%)
Annotation
HeliconiusHMEL0039150.068.31% 
BombyxBGIBMGA002300-TA0.071.10% 
DrosophilaCG6051-PC1e-10458.39% 
EBI UniRef50UniRef50_D6WZX85e-12059.84%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZX8_TRICA
NCBI RefSeqXP_974964.11e-12059.84%PREDICTED: similar to CG6051 CG6051-PB [Tribolium castaneum]
NCBI nr blastpgi|910910042e-11959.84%PREDICTED: similar to CG6051 CG6051-PB [Tribolium castaneum]
NCBI nr blastxgi|2420181683e-15935.45%zinc finger protein FYVE domain containing protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00468722.6e-28metal ion binding
KEGG pathwaycbr:CBG015678e-16 
 K12182 (HGS, HRS, VPS27)maps-> Endocytosis
    Phagosome
InterPro domain[1050-1119] IPR0003062.6e-28Zinc finger, FYVE-type
[1057-1116] IPR0110117.8e-23Zinc finger, FYVE/PHD-type
[1050-1116] IPR0130832.5e-22Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL14490 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215544-TA
ATGAGGCTGTTATACAAAAGTGGACCAATGGAGTCTCTCAGAAAATGGTTCTACAAACCAAAGAGGGATGACGGTTCTCTACTAGCGCAGTTCTTCTTCGCTGATGACGCGCTTAATATGATAGCTGCTGAGTTGGACTCTTTCGATGGTCGAAAGGACCCAGAGAGGTGTTCGACACTCGTGAACCAACTACGTCATGCACAGAGGGATGACGGTTCTCTACTAGCGCAGTTCTTCTTCGCTGATGACGCGCTTAATATGATAGCTGCTGAGTTGGACTCTTTCGATGGTCGAAAGGACCCAGAGAGGTGTTCGACACTCGTGAACCAACTACGTCATGCACAGGATAGAGTTCTCAACATTACTAGTCAGATAATGGACCAGGTTCTCGGAGATGAGAGAGTCGCTCGCGGCTTCCGAGTTAAATTCCCGGAGGACGTGCTGCAGGATAATCTGGCTGGTCAGCTTTGGTTTGGGGCCGAGTGCCTGGCAGCTGGGTCGTCTATAATGAATCGGGAGGAAGAATCAGCCGCTATGAGACCGTTAGCTAAGGCTTTGACCAGAAGCCTGGAGACCGTGAGGTCTTTGCTCAGAGAACAGTGTCTCCGCCCACGTGGTCTGGCCCTACAAGATCATGACGACATGCTCCACGAGAGTTTGAGGATATTTGACAGATTGTTCGCTGAATTCGAGCTATGTTACGTGAGCGCGATGGTAAATGTCAAGACTCCACACGAATTTGAAGCCCAGCAGTTGATATGTGTCCTGTTCTCTGAGAGTCTAGGACGAGCTCTCAAACAAAGTCTGCTTACACAGGAACAGGTGGATTCATATGATCCAGCGCTAATGTTCGCCGTTCCACGTCTCGCTATAGTCAGCGGCCTCTTGATATATTCCAGCGGACCTCTCAGTATAGACAAACCACCTGAAGAAATGTCCGACATGTTCCGTCCCTTTCGCACCCTGCTTCACAAAATTCGCTCTCTTCTTTGGACTTTGGATCGGCGGGAGCTACTTGTGCTTGAGCGATTGTTATGTACCAACGAAGACGTAGCCAGTCTCGCCGGGCTTGATATACCCGCTGATGATTCCACAGCTGATTCATGTTATCCTGATATTGGCGAGTTCGTGTCTAAATTCTACGCGGATAACGTGCACTGCCGGGATCTATACACCCAGAGTTCAAGCGAACATTTGATAACAGACGCCGATTACCTTCCCGACAACATTGAGGTTATGACTCCAATTATTGAAGGACTAAAACGTCTCACAGAGGATATCGATGAATCTGAACGAGAATCCATCGATATCGACAATAAACTCACGGATAAGGACAGCACTAGCACTGATACATTCCAGTCGACAGAGAGCAAAGAATTTGATAAAATGCGGAAGATTAGTAGGGCGGAAAGTGATTTATCGTCGCTGAGAAATCTATCGACGAATGGTCTCATGATGTTCGATCCATTGGTTATAAATGCGGTGTCAGCATCGACATCAGAACGACAAATCCCGGACCAAGATTTAGTCACATCACTAAACGAGCAAGTAACTGAAATAGCGGATAGATTATCCTCTATAGTTAGTGATGTGGATATACACGATCACCAAATACACTCGTTCTCAACAAACGATGTCCCATCCTTGATATCGAATGAGAATTCTGAGGATTACGGCTTTAGTGGTCCAAGGAACGAACAGTTGAGTTGTATGCCATCCACTAGTCATGGTTATTTGATACCAAACGCTATTAGTCAGGAGCCCGTAGTACTAGCCTCACTCTCACACAATAACTCCGCTGTGTTCAACCAGGACGAAAGCACTGATATTAACGGCGCAATACTGAACTCTGATCTCCCATTAATACTGACCGATAATATTGACGCTGATATTGTAAATACTAATATATCAATAGCAAACGTGAATCTTAGTTCACTGTTGTTGACGAACGAGGAATTCAGACAGAATTTCATTGGAAACGAAAATGACGGTGACAATACAAGCTTCCATTCAGCAAAAATGACGATGGATAAAGATGAAAGGGTCAAATTTATGCTCGGCTACGACAGCGAGCACGAGTCACCAGCCGATTCTGGTGTCAGTACTGAAAATACAAGCCTAGACAGATCGCCTGACACGGACCAGAATAAAGATATCAAATCAAACTATTTCATGCAACAGAACAGGGTATTAAAAAGTCCAATAGACGAGCGGTTACAGAAAGAAAACGTTCTAGAAAGTTCCTTCTCTGATGTAAGAGACGATGTTAATATAAATTACGTTGAAGATGACACACAGAAGAATGAGGCGTCGTCTAGCGTTATTGATAGGTTGAGTGACGACGGCTTGAACGAAGCTACTAATGTTAATGTACAGAACTGGACCACAAATGAACCGCAAGACGAAATAAGGACCATAGAGGATGTTATAACAGGCCTTAGAAGACACAGAGAAGCCGATAAGAAGAATACGTTAGAGGGAACGAACGAACCGAACTGTTCCACGAGCACCAGGCGGAAAAGGAAATCGAGCACCAAGAAGAAAAAGAAAAAGATGTGGTCGCCCGGCATCGTTATACTCGATAACAGATCAAACCAAAACCAAAGTCCCTTAAAGTTGAACCTGGATCACTTTGTCGAGAATACATCTTGCAGTACGTCCGAGTGCGCTGACGACGAGCAGATCGCCCTCGCGCTTCAAGCACAGGAACTAGCAGCAAGAAGACGGGCCAGGGATAAATTCAAGTCATCCGAGGATCTGATACATCGTCTGTTCGTGTGTATAGCCGGGGTGGCTGATCAGTTGCAGACTAACTTCGCAGCGGATCTACGTAACATATTGAAGGCTGTTTTCCTCATTAACCAGACGGCGGATGTGCCAGAGAGCATAGAGTACCAGGCCGGGGAGGATCAGGTCATACAAAACGGAAGTTCCTTCGACAGCGTTTATTCAGCGGAGGAAGTGTACGCGGACAGTAACTCCGACTCTACATCAGAGCCATCTAGAGTAGCAAGACGTAACACAGTCAACTCTACCATGAACGAGCCTAACGTTAGCGATAGACGTAAATCAGCGGACACCATGCAAGCCTCTGTGTCAACTGGAGATCTTACATATAGGGATGACACGTCTTCGTCGTCAGTCGTGGAGCGAGCGCCCGAGTGGGTCCCTGATATAGCGGCGCCGGCCTGTATGAGATGCTCCTCACACTTCACAGCCTTCAGGCGTAGACATCACTGTAGGAACTGTGGTAAAGTATTCTGTGCTTCGTGCAGTTCGAATTCAATACCATTACCGAGGTTTGGTCAGTTGAAGCCGGTGCGTGTGTGTGAGGAGTGCTACCAGACCAACTGTGGGAGACAGACGAATCGGTGA

Protein sequence:

>DPOGS215544-PA
MRLLYKSGPMESLRKWFYKPKRDDGSLLAQFFFADDALNMIAAELDSFDGRKDPERCSTLVNQLRHAQRDDGSLLAQFFFADDALNMIAAELDSFDGRKDPERCSTLVNQLRHAQDRVLNITSQIMDQVLGDERVARGFRVKFPEDVLQDNLAGQLWFGAECLAAGSSIMNREEESAAMRPLAKALTRSLETVRSLLREQCLRPRGLALQDHDDMLHESLRIFDRLFAEFELCYVSAMVNVKTPHEFEAQQLICVLFSESLGRALKQSLLTQEQVDSYDPALMFAVPRLAIVSGLLIYSSGPLSIDKPPEEMSDMFRPFRTLLHKIRSLLWTLDRRELLVLERLLCTNEDVASLAGLDIPADDSTADSCYPDIGEFVSKFYADNVHCRDLYTQSSSEHLITDADYLPDNIEVMTPIIEGLKRLTEDIDESERESIDIDNKLTDKDSTSTDTFQSTESKEFDKMRKISRAESDLSSLRNLSTNGLMMFDPLVINAVSASTSERQIPDQDLVTSLNEQVTEIADRLSSIVSDVDIHDHQIHSFSTNDVPSLISNENSEDYGFSGPRNEQLSCMPSTSHGYLIPNAISQEPVVLASLSHNNSAVFNQDESTDINGAILNSDLPLILTDNIDADIVNTNISIANVNLSSLLLTNEEFRQNFIGNENDGDNTSFHSAKMTMDKDERVKFMLGYDSEHESPADSGVSTENTSLDRSPDTDQNKDIKSNYFMQQNRVLKSPIDERLQKENVLESSFSDVRDDVNINYVEDDTQKNEASSSVIDRLSDDGLNEATNVNVQNWTTNEPQDEIRTIEDVITGLRRHREADKKNTLEGTNEPNCSTSTRRKRKSSTKKKKKKMWSPGIVILDNRSNQNQSPLKLNLDHFVENTSCSTSECADDEQIALALQAQELAARRRARDKFKSSEDLIHRLFVCIAGVADQLQTNFAADLRNILKAVFLINQTADVPESIEYQAGEDQVIQNGSSFDSVYSAEEVYADSNSDSTSEPSRVARRNTVNSTMNEPNVSDRRKSADTMQASVSTGDLTYRDDTSSSSVVERAPEWVPDIAAPACMRCSSHFTAFRRRHHCRNCGKVFCASCSSNSIPLPRFGQLKPVRVCEECYQTNCGRQTNR-