Monarch geneset OGS2.0

DPOGS212823
TranscriptDPOGS212823-TA1368 bp
ProteinDPOGS212823-PA455 aa
Genomic positionDPSCF300086 - 287429-288796
RNAseq coverage543x (Rank: top 23%)
Annotation
HeliconiusHMEL0101180.096.92% 
BombyxBGIBMGA000769-TA0.092.31% 
DrosophilaCG11537-PD0.070.96% 
EBI UniRef50UniRef50_Q5SR565e-15666.03%Hippocampus abundant transcript-like protein 1 n=87 Tax=Metazoa RepID=HIAL1_HUMAN
NCBI RefSeqXP_001687981.10.072.30%AGAP007253-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582859770.072.30%AGAP007253-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3227864740.072.13%hypothetical protein SINV_10524 [Solenopsis invicta]
Group
Gene OntologyGO:00550856.9e-33transmembrane transport
GO:00160216.9e-33integral to membrane
GO:00058868.1e-24plasma membrane
GO:00052158.1e-24transporter activity
KEGG pathway 
InterPro domain[1-418] IPR0161961.6e-52Major facilitator superfamily domain, general substrate transporter
[13-357] IPR0117016.9e-33Major facilitator superfamily
[46-65] IPR0019588.1e-24Tetracycline resistance protein, TetA/multidrug resistance protein MdtG
Orthology groupMCL11291 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212823-TA
ATGTCGGGCATCGGGGAACCCTCGGTGTTCCACGCTTTAGTGGTAATATTTTTGGAGTTCTTCGCGTGGGGATTACTCACGATGCCAATAATATCTGTCCTGAACGCCACGTTTCCAGATCATACGTTTTTGATGAACGGGCTTATTATGGGAATAAAAGGAATTTTGTCCTTTTTATCCGCTCCTCTTATCGGAGCATTATCTGACGTGTGGGGAAGAAAGTTTTTCCTTTTGGTAACAGTATTTTTTACCTGTGCACCTATACCACTGATGACCATTAACACATGGTGGTTCTTTGCGATGATAAGTATAAGTGGAGTGTTTGCTGTGACCTTCTCGATTGTATTCGCATACGTAGCCGATGTCACCACCGAGGCTGAGCGGTCTCGCGCCTATGGTCTCGTGTCCGCCACCTTCGCCGCCAGTATGGTCATATCACCTGCGTTAGGAGCTTACCTTATGGATTTGTATGGAGAGGCTCTCGTCGTTGCGGCTGCCACTGCAGTAGCAGTGCTGGACGTCTTCTTCATCATGGTTGCAGTTCCTGAAAGTCTTCCAGAGAAAGTACGACCCAGTGGCTGGGGGGCCAACATAAGCTGGGAACAGGCAGACCCATTTGCTGCCCTCAGGAAAGTTGGTGCAGAACGCACAGTGCTTATGTTGTGTGTTGCAGTGTTCCTATCCTATCTGCCAGAAGCCGGACAGTATTCATGCATATTTGTATATTTGAAGCTAGTGATGGGTTTTGGTGTAGTGCAGGTGGCGATCTTTATTGCAATAGTTGGTGTACTCAGTATTGCTGTGCAAGTGGTACTTGGATTTTTAATGAAATCTCTTGGTGCAAAGCACACAATTATGCTTGGACTTTTGTTTGAAATGATGCAACTTATGTGGTATGGATTTGGGAGTCGCACTTGGATGATGTGGGCCGCTGGAGTCCTCGCCGCTTTAGGGTCTCTCACATATCCAGCCATCAGTGCATATGTGTCTGTAAATAGCCGAGCAGACAGGCAGGGTGTAGTGCAAGGCATGGTAACTGGAGTTAGAGGGCTCTGCAATGGCCTTGGTCCTGCTATGTTTGGTGTTATATTCTACCTATTCCATGTTGATCTTAATGAAGAACATGCAGTACCAGGAATAAATACTCGACCAGATGATGAAAAGTATGTGCGATTGGTGCCCGGTCCACCATTTGTCTTGGGGGCTCTTCTCGTTATATGTGCACTGCTTGTGGCCGCTTTCCTGCCTGAAGATGGAACAGTGGGACCTCGAAGGTCTTCCCCCGATCTACGGTTCGAAGTGGAGCACGGCCGCCGCGTGGCAGGTCCCCTGTCCCCCTTGATGGCCCCTGAATCTGCAGCTCTCTAG

Protein sequence:

>DPOGS212823-PA
MSGIGEPSVFHALVVIFLEFFAWGLLTMPIISVLNATFPDHTFLMNGLIMGIKGILSFLSAPLIGALSDVWGRKFFLLVTVFFTCAPIPLMTINTWWFFAMISISGVFAVTFSIVFAYVADVTTEAERSRAYGLVSATFAASMVISPALGAYLMDLYGEALVVAAATAVAVLDVFFIMVAVPESLPEKVRPSGWGANISWEQADPFAALRKVGAERTVLMLCVAVFLSYLPEAGQYSCIFVYLKLVMGFGVVQVAIFIAIVGVLSIAVQVVLGFLMKSLGAKHTIMLGLLFEMMQLMWYGFGSRTWMMWAAGVLAALGSLTYPAISAYVSVNSRADRQGVVQGMVTGVRGLCNGLGPAMFGVIFYLFHVDLNEEHAVPGINTRPDDEKYVRLVPGPPFVLGALLVICALLVAAFLPEDGTVGPRRSSPDLRFEVEHGRRVAGPLSPLMAPESAAL-