Monarch geneset OGS2.0

DPOGS200492
TranscriptDPOGS200492-TA1806 bp
ProteinDPOGS200492-PA601 aa
Genomic positionDPSCF300158 + 35234-47953
RNAseq coverage901x (Rank: top 14%)
Annotation
HeliconiusHMEL0145756e-13441.06% 
BombyxBGIBMGA005094-TA3e-11839.02% 
DrosophilaCG4822-PB9e-12041.46% 
EBI UniRef50UniRef50_D2A1P63e-16449.07%Putative uncharacterized protein GLEAN_08427 n=3 Tax=Endopterygota RepID=D2A1P6_TRICA
NCBI RefSeqXP_001811847.17e-16549.07%PREDICTED: similar to CG4822 CG4822-PA [Tribolium castaneum]
NCBI nr blastpgi|2700062571e-16349.07%hypothetical protein TcasGA2_TC008427 [Tribolium castaneum]
NCBI nr blastxgi|1936246162e-16049.58%PREDICTED: ATP-binding cassette sub-family G member 1-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00055248.1e-230ATP binding
GO:00160218.1e-230integral to membrane
GO:00160202.1e-24membrane
GO:00168872.4e-19ATPase activity
GO:00001669.3e-18nucleotide binding
GO:00171119.3e-18nucleoside-triphosphatase activity
KEGG pathwayrno:852641e-117 
 K05679 (ABCG1)maps-> ABC transporters
InterPro domain[14-594] IPR0200648.1e-230ABC transporter, G1-like
[313-523] IPR0135252.1e-24ABC-2 type transporter
[42-164] IPR0034392.4e-19ABC transporter-like
[27-213] IPR0035939.3e-18ATPase, AAA+ type, core
Orthology groupMCL32210 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200492-TA
ATGGGTCGTAAATGTAACATCCTGCAACTTGAAACAAGTGAGAGGACAATTCTTCACAACGTATCAGGTGAGTTTCGGTCTGGGGAGCTGACTTGCATACTTGGACCCTCAGGAGCTGGAAAATCCACTTTGCTGAACATCCTCGCTGGGTATACATTGTCCGGTGTCAATGGTAGAATAACCGTGAACGGTCAAGCTAGGGATATGAGGGTGTTTAAAAAACTCAGCAGCTACATCATGCAGGATGACATATTGCAGCCAAGGCTGACAGTCAACGAATCATTGAAGATAGCTGCGGAGCTTAAACTGGGTTCAGAACTAGGGAAAGCTGAAAAGGCACTCGTGGTGGAGGAAATTCTACAAACTTTGGGACTATGGGATCATAGAGACACCATGTCACAAAGTCTCTCTGGAGGACAATCGAAACGTCTGTCAATTGCCTTGGAACTGGTCAACAATCCACCGATTATATTCCTCGATGAACCCACAACCGGTCTAGATATTGTATCTGTTCGTCAGTTGGTAGTCCTGCTTCGTCTCCTCTCACGTCAAGGCCGAACTATCATTTGTACGATCCACCAACCATCAGCATCGCTGTTTTCACTATTCGACCGAGTCTACGTTTTATCCCGCGGACTGTGCTGTTATCAGGGCGCAGCGCCTCTTCTTGTGCCATATCTGGCTGAAGTCGGTCATGTTTGTCCAAAAACGCATAACCCAGCAGATTTTGTGCTGGAAACTCTAGTTGGTAATGTGGATACGGCAGCACAAATGTCGGAATTATGCCAAAATGGGAAGCTCTGTAGGAATGTAAATAAAATGACAAGGGATGGGAGAAAACCGGAGTTACGTTCGTATGAATCAATTCAAAGAATATTTACAGAGCACGTGGCGAAAGAACAACTGCAGAAAATGAATTTTCCAACATCATTCACGACACAGTTTCTGATTTTAGTTAAGAGAATGTTTTTACAGATGAGCAGGAATTCGCTTAGTCTATGGATACAACTGCTCCATCATGTGATTGGAGCTCTGCTAACGGGCGGGATATTCTTTCTCATCGGAAACGATGGCAACCAGCCCATCGCCAATTTCAAATTCTGTATCAGCTGTGTTGTATTCTTCATGTACACGTACCTTATGATTCCTATATTGTTGTTTCCAACGGAATTAAGGATGCTTCGTCGAGAGTACTTTAACTGTTGGTACAGCTTGAAAGCCTATTACGCAGCTTTGACACTATCTACAATACCATTGCTAATAATCCTGGGTACTTTATTTATAGTCATCTGTTACACCATGTCAGGGCAAATTTTTGAGTTCGAGCGCTTCGTGCTATTCACAATCACGGGGCTTTTAACTGGAATATGTTCTGAAGGCTTCGGCTTACTGATCGGATCCACATTTAACGTAACCAACGGTTCCATCGTTGGACCAGCGACTGTTTCTCCACTTCTAGCCCTATGTTGTTACGGCCTGGGTTTTGGATCTCATATAGAAACCTTCATGAAATTTCTAATGTCGTTATCTTATTTACGTTATTCATTAAATGGATTCTGTTTAGCTTTGTATCAGATGCGTCCTGCTTTGAATTGTGATACTGATTTTTGTTTGTACGCTGATTCCAAAATACTTTTAAGGGATTTGGGGATGATTGATCACACTTACAGCACACAGATGATTTGTTTACTAGTATTTACGATCGTTCACAGATTTTTTGCATATTTCGCATTAAAATACAGTCTTAGAGAGGAGTTTACAAGCAAATTCATGACTTATGTTAGTAAATTTTTAAAACATAGATAA

Protein sequence:

>DPOGS200492-PA
MGRKCNILQLETSERTILHNVSGEFRSGELTCILGPSGAGKSTLLNILAGYTLSGVNGRITVNGQARDMRVFKKLSSYIMQDDILQPRLTVNESLKIAAELKLGSELGKAEKALVVEEILQTLGLWDHRDTMSQSLSGGQSKRLSIALELVNNPPIIFLDEPTTGLDIVSVRQLVVLLRLLSRQGRTIICTIHQPSASLFSLFDRVYVLSRGLCCYQGAAPLLVPYLAEVGHVCPKTHNPADFVLETLVGNVDTAAQMSELCQNGKLCRNVNKMTRDGRKPELRSYESIQRIFTEHVAKEQLQKMNFPTSFTTQFLILVKRMFLQMSRNSLSLWIQLLHHVIGALLTGGIFFLIGNDGNQPIANFKFCISCVVFFMYTYLMIPILLFPTELRMLRREYFNCWYSLKAYYAALTLSTIPLLIILGTLFIVICYTMSGQIFEFERFVLFTITGLLTGICSEGFGLLIGSTFNVTNGSIVGPATVSPLLALCCYGLGFGSHIETFMKFLMSLSYLRYSLNGFCLALYQMRPALNCDTDFCLYADSKILLRDLGMIDHTYSTQMICLLVFTIVHRFFAYFALKYSLREEFTSKFMTYVSKFLKHR-