Monarch geneset OGS2.0

DPOGS215420
TranscriptDPOGS215420-TA1764 bp
ProteinDPOGS215420-PA587 aa
Genomic positionDPSCF300088 + 795740-832415
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0174467e-16591.36% 
BombyxBGIBMGA012380-TA3e-15585.38% 
DrosophilaCG9090-PA1e-10666.44% 
EBI UniRef50UniRef50_Q003254e-10263.55%Phosphate carrier protein, mitochondrial n=212 Tax=Eukaryota RepID=MPCP_HUMAN
NCBI RefSeqNP_001040419.11e-15184.72%mitochondrial inorganic phosphate carrier [Bombyx mori]
NCBI nr blastpgi|1140516342e-15084.72%mitochondrial inorganic phosphate carrier [Bombyx mori]
NCBI nr blastxgi|1140516344e-15084.72%mitochondrial inorganic phosphate carrier [Bombyx mori]
Group
KEGG pathway 
InterPro domain[2-278] IPR0233951.3e-56Mitochondrial carrier domain
[10-90] IPR0181081.5e-17Mitochondrial substrate/solute carrier
[417-461] IPR0041821.2e-06GRAM
Orthology groupMCL25959 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215420-TA
ATGTTCAGTAATAAATACTTCATGCTATGTGGTGCGGGAGGCGCCGTCGCCTGTGGTAGTACTCATACTTTATTAACCCCCTTGGACTTGGTTAAGTGCAGATTGCAAGTTGATCCAGGGAAGTACAAGTCCATATTCAAAGGTTTCGGAATATCCTTAAGTGAAGGTGGAATAGCAAACCTAGTAAAAGGATGGGCCCCAACGCTTATCGGTTATACCCTCCAAGGATCTGCAAAATTTGCTGGGTACGAATATTTCAAATATAAGTATGCAACTATGGTGGATGAAGAAAGTGCTTATTTATACAGAACATATTTATACTTGGCCGCGGCTGCATCGGCAGAATTAGTTGCAGATATTTTTTTAGCACCGTTTGAAGCTGTTAAAGTCAGAATGCAAACGACACCCGGTTATACGTCTCAGATGAGAAAAGCTATGCCACATATGATGTCTACAGAAGGGTTTGGTGTTTTTTATAAAGGTTTGACTCCTCTTTGGGGAAGACAAGTTCCTTATACAATGATGAAATTCGCATCTTTCGAAAAGACACTAGAGTTTTTATATGAAAATGTAGTTCCAAAACCAAGAGAACAGTGTACTAAAGTTGAGCAGTTAATAGTCACCTTTGCTGCTGGATATATAGCCGGTGTTCTTTGCGCCATCGTATCACATCCTTCAGATACAATTGTCTCCAAGTTGAATAAAGACCCGAGTTCTTCTATCGGAGGCATAATATCAGAAGTTGGAATGATTGGTATATGGCGCGGGCTAGTTGCTCGTATCATAATGATTGGGACTCTCACAGGTCTTCAATGGTTTATCTATGATGCCTTCAAAGTTTATACTCAAATGCCACGCCCTCCACCGGCTGAAATGCCAGAATCTCTGAAAAAGAAATTAGCTGGAAATGTAGCTGAATCTGATAGTGCAGATGCGGAGTCGGGATTCGAGGAGGCTGATCCCGGAGAGGCGGGGAACAATGTTATTAAGATGGTATCGAGATTCGTTGACAAGGTGTGTACGGAAGGCGGCGTGACGGCGGAGCACGTGCGCTGTCTTCATCAGATGATACCCGGCGTCGTTCACATGCACCTGGAGACGTTGGACGGAGTGGCCAGGGAGAGTCGGCGGCTTCCACCCGTACAGAAGCCTCGGATAGCGTGGCCGACCTTAGTGCACGGCGAGGCCAGTGCGGGTGGCGCCCTACGCGCGGTGCTGCTCGCTGATGGCCGGAGTGCTGCACTGCCACAGCTGCTGCCAGCCGAGGGAGCGCTGTTCCTCACCAACTACAGGCTGCTGTTCAAAGGAGTTCCTGTCGACCCTTACGCATGCGAGGCGACGGTGGTCCGCTCGTTCCCCCTGAGCGCCCTCACCCGCGAGAAGGGTGTCCGCGCCGCGCCCGCCCACCTGGAACACGCGCCTCATGACGCCTTGCAGCTCAGGGCTGCCACCTTCCAGCTTATTAAGGTGGCGCTGGACGAGGAGGTGAGCAGCGAGCAGGCGGAGTCGTTCCGTAAGGCGGTGTCTCGTCTGCGGCACCCTCCCCACCCCCTGCTGCACTTCGCCCTCGCGCCTCGAGCTGCACCCCCCAGCGACCTCGCTCAGCCTAAACACCACACACTCAAAGGATTCGCCAAAAAGACCCTCCTGAAGACGGCTCGTCGTGCCGGGTTGAAGCCCAAGCCATCCAAGCGACAGAAGTACGTGCTGCCGGCGGACGCTCGAGAAACGAAAAAAGTAAAGACAAAAGACAGGCGTCAATAG

Protein sequence:

>DPOGS215420-PA
MFSNKYFMLCGAGGAVACGSTHTLLTPLDLVKCRLQVDPGKYKSIFKGFGISLSEGGIANLVKGWAPTLIGYTLQGSAKFAGYEYFKYKYATMVDEESAYLYRTYLYLAAAASAELVADIFLAPFEAVKVRMQTTPGYTSQMRKAMPHMMSTEGFGVFYKGLTPLWGRQVPYTMMKFASFEKTLEFLYENVVPKPREQCTKVEQLIVTFAAGYIAGVLCAIVSHPSDTIVSKLNKDPSSSIGGIISEVGMIGIWRGLVARIIMIGTLTGLQWFIYDAFKVYTQMPRPPPAEMPESLKKKLAGNVAESDSADAESGFEEADPGEAGNNVIKMVSRFVDKVCTEGGVTAEHVRCLHQMIPGVVHMHLETLDGVARESRRLPPVQKPRIAWPTLVHGEASAGGALRAVLLADGRSAALPQLLPAEGALFLTNYRLLFKGVPVDPYACEATVVRSFPLSALTREKGVRAAPAHLEHAPHDALQLRAATFQLIKVALDEEVSSEQAESFRKAVSRLRHPPHPLLHFALAPRAAPPSDLAQPKHHTLKGFAKKTLLKTARRAGLKPKPSKRQKYVLPADARETKKVKTKDRRQ-