Monarch geneset OGS2.0

DPOGS208075
TranscriptDPOGS208075-TA2148 bp
ProteinDPOGS208075-PA715 aa
Genomic positionDPSCF300282 + 21073-39057
RNAseq coverage505x (Rank: top 25%)
Annotation
HeliconiusHMEL0033420.071.09% 
BombyxBGIBMGA007784-TA0.074.60% 
DrosophilaCG7627-PB2e-13937.70% 
EBI UniRef50UniRef50_D2K6M70.065.09%ATP-binding cassette sub-family C member 4 n=2 Tax=Trichoplusia ni RepID=D2K6M7_TRINI
NCBI RefSeqXP_969849.15e-16543.51%PREDICTED: similar to ATP-binding cassette transporter [Tribolium castaneum]
NCBI nr blastpgi|2702097610.065.09%ATP-binding cassette sub-family C member 4 [Trichoplusia ni]
NCBI nr blastxgi|2702097630.065.09%ATP-binding cassette sub-family C member 4 [Trichoplusia ni]
Group
Gene OntologyGO:00068101.6e-29transport
GO:00550851.6e-29transmembrane transport
GO:00055241.6e-29ATP binding
GO:00426261.6e-29ATPase activity, coupled to transmembrane movement of substances
GO:00160211.6e-29integral to membrane
GO:00168874.7e-15ATPase activity
GO:00001666.6e-12nucleotide binding
GO:00171116.6e-12nucleoside-triphosphatase activity
KEGG pathwaydpo:Dpse_GA182609e-138 
 K05673 (ABCC4)maps-> ABC transporters
InterPro domain[65-400] IPR0115271.6e-29ABC transporter, transmembrane domain, type 1
[96-360] IPR0011407.1e-26ABC transporter, transmembrane domain
[472-583] IPR0034394.7e-15ABC transporter-like
[457-631] IPR0035936.6e-12ATPase, AAA+ type, core
Orthology groupMCL22168 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208075-TA
ATGGAATCGAAAGTTGTGTTGAAGGAGAAAAATCCACACGATAAAGCCAATTTTCTCTCAAAAATATTTTTATGGTGGAGCTTCAAGCTATTCAAAAGAGGATACAAAACAGGATTGACAATAGAGGATCTATGGCAGGCCCGCAGTGTGGACCAATCGGGTCCTCTCGGGGACAGGCTTGAAGAGGCTTGGGAAAGGGAAGTTGAAAGATCTCGTAAAAATAATACCAAACCATCACTCTCCAGAGCCATTGTCAGCAGTTTCTGGTTAGAGTATATGACAAGCGGGATTTTAGTTGGTGTTTTGTTCATTGTCGTATGGCCTCTGATCCCTTACACTCTAGCTTTATTCATAAAATATTTTTCATCTGAGAAAACGCCAGAATCGTATAGAAATGCACACATACATAACTTTTTATTGAATTTTTTTTCCATCTTATCGGCCCTTCTAATGAATCATACTAATTTATCTCAGGCTTGTGTTGGTATGAGGGTGAGGATAGCCTCATGTTCTCTCGTCTATAGGAAGATATTGAAACTGAACCGTGTTGGCATAAGTAAAACTGACTCCGGCCAAGTCATCAACTTGATGTCAAACGATGTGAACAGATTCGACATAGCGGCACCATTATTGAGTATTTTGTGGGTGATGCCGATCGTGGTACCAGTCGTTTGTTACTTGGTCTGGCAACACATTGGTTACGCGACATTGGCCGCTCTCGCCGTTATTGTTATACAAACAGTCACAGTACAAGTCTATTTGAGTAATCGTCAAGGTATATTGAGGGGGAAAATCGCGAGACGTACGGATGAAAGGGTGAAAGTTATGAGCGAATTAGTCAATGGAGTTCAGGTCATCAAAATGTATGCTTGGGAGAAGCCTTTCGAAAAGCTCGTGGACAAACTAAGAAAAATTGAAATAAAGTTTATATTACGAACATCGTTTATAAAAGGTTTCTCTACCGCTTTGAGCGTGTTCACGGAACGTTTCATACTATACGCAGCTGTCGTCACCTTCGTGGTTACCGGAGGGGAAATCAGCTCGGACATCACATTCTCCCTCGTTCAATATTACAATCTGATGCAACTAGCCTGTAATATATTATTCCCAATGGCTCTTGCGTTCCTCGCGGAATCTAGAGTGTCCATACGTCGATTGGAGGAATTTCTAGCACTTGATGAATCGGAAGGTGATGATCCAAAAATAGCTAGTTTCAACGCTAATTCGTTGCTTAACGGAATGGACAGCGAAAAGGAGAATTCAAATAACAGAGTGAAGCCGACGGGGCTGGTTATATCTAATGCTACAGCCAGTTGGCAGCCCAACCCTATAGTGCACACATTGAGGAACATAACCCTTAATTTTCAACCTGGAGAATTTATAGGAGTCGCTGGTCTTGTAGGATCTGGAAAGTCGTCCTTCCTCCAACTAATCCTTGGTGAGCTGCGACCTTCTAAGGGTACGGTGTCCCTGGGAGGTTCGAGGGTATCGTACGCCAGCCAAGAGCCTTGGCTCTTCGTGGCCACAATCAAACAGAACATACTCTTCGGTCTTCCATACGATCGCTTGAAGTATAAAAAGGTGGTGACGGCGTGTGCATTGTTACGGGATTTCGAACAGCTACCGGCTGGCGACTCCACAATGGTCGGTGAAAGAGGCATCAGCCTCAGTGGTGGTCAGCGAGCGAGAATCGGTCTAGCACGTGCCTGTTATAGAAATGCTGATATTTATCTGTTGGACGATCCGCTATCGGCCGTCGACACTCACGTCGGTAAACACATAGTGTCTGAATGTGTGATGGGACTATTGAGACACTCTACGAGGATTCTGGTCACACATCAGCTGCACCATCTCAAGCAAGCTGACAGAGTAGTCATACTACACAATGGTGAGGTGGAAACGTGTGGTACATTCGAGGAGGTATCGAAGTGTCCATTATTCAAGGAGCTTGAGCATGAAGAGCAATCTCCAGATGACTCCGCCAATCCACAGATACTCAGGAAAAGGACCCTCTCCGTTCAGTCTCGCTTGAGCGCCAGTACTACAGCGGAGTCACCTCTAGATGAAGAGGTCGAGCCAAGCGAGAGCGACGAGCTGATGGAAAAAGGTAGGGAAATAACCCGTCTTCACTCACAATTGAACACGTGA

Protein sequence:

>DPOGS208075-PA
MESKVVLKEKNPHDKANFLSKIFLWWSFKLFKRGYKTGLTIEDLWQARSVDQSGPLGDRLEEAWEREVERSRKNNTKPSLSRAIVSSFWLEYMTSGILVGVLFIVVWPLIPYTLALFIKYFSSEKTPESYRNAHIHNFLLNFFSILSALLMNHTNLSQACVGMRVRIASCSLVYRKILKLNRVGISKTDSGQVINLMSNDVNRFDIAAPLLSILWVMPIVVPVVCYLVWQHIGYATLAALAVIVIQTVTVQVYLSNRQGILRGKIARRTDERVKVMSELVNGVQVIKMYAWEKPFEKLVDKLRKIEIKFILRTSFIKGFSTALSVFTERFILYAAVVTFVVTGGEISSDITFSLVQYYNLMQLACNILFPMALAFLAESRVSIRRLEEFLALDESEGDDPKIASFNANSLLNGMDSEKENSNNRVKPTGLVISNATASWQPNPIVHTLRNITLNFQPGEFIGVAGLVGSGKSSFLQLILGELRPSKGTVSLGGSRVSYASQEPWLFVATIKQNILFGLPYDRLKYKKVVTACALLRDFEQLPAGDSTMVGERGISLSGGQRARIGLARACYRNADIYLLDDPLSAVDTHVGKHIVSECVMGLLRHSTRILVTHQLHHLKQADRVVILHNGEVETCGTFEEVSKCPLFKELEHEEQSPDDSANPQILRKRTLSVQSRLSASTTAESPLDEEVEPSESDELMEKGREITRLHSQLNT-