Monarch geneset OGS2.0

DPOGS211475
TranscriptDPOGS211475-TA993 bp
ProteinDPOGS211475-PA330 aa
Genomic positionDPSCF300113 - 260321-263372
RNAseq coverage5373x (Rank: top 2%)
Annotation
HeliconiusHMEL0046162e-9277.83% 
BombyxBGIBMGA007988-TA3e-13477.08% 
DrosophilaNurf-38-PA6e-10554.13% 
EBI UniRef50UniRef50_F5HK076e-10363.32%AGAP003398-PD n=19 Tax=Coelomata RepID=F5HK07_ANOGA
NCBI RefSeqXP_967051.12e-10865.36%PREDICTED: similar to AGAP003398-PA [Tribolium castaneum]
NCBI nr blastpgi|3125975982e-13177.08%inorganic pyrophosphatase [Bombyx mori]
NCBI nr blastxgi|3125975982e-13077.08%inorganic pyrophosphatase [Bombyx mori]
Group
Gene OntologyGO:00002871.6e-144magnesium ion binding
GO:00044271.6e-144inorganic diphosphatase activity
GO:00067961.6e-144phosphate metabolic process
GO:00057371.6e-144cytoplasm
KEGG pathwaytca:6554135e-108 
 K11726 (NURF38)maps-> Oxidative phosphorylation
InterPro domain[14-330] IPR0081621.6e-144Inorganic pyrophosphatase
Orthology groupMCL11596 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211475-TA
ATGTCAGTAACACGCGTTTTAGCGGCGAGTCCCCGGTGTTTGTCATTTGTTAAAACGCTTTGTGCGTCTGTCGGCGTAGTTCACGTTAGGAAAGTGACGACGGTTACTGAAACGATCCGCTCCAAGATGTTCATCGCTGAAGAACGTGGCTCCCCTTTCGCGCCCGACTACAGGGTCTATTTTAAGGACGAGAGCGGTCCCGTCTCTCCGCTCCACGACATCCCGCTGTGGGCGGACCGCGGCCGGCGCGAGGCGCACATGGTGGTGGAGGTGCCGCGCTGGAGCAACGCCAAGATGGAGATCAGTCTCGGGGAACCTCTCAACCCAATCAAGCAGGACGTGAAGAAAGGCGCGCTCAGGTTCGTGGCCAACGTGTTCCCCCACCACGGGTACATCTGGAACTACGGAGCGCTGCCTCAGACCTGGGAGAACCCGCAGCACGTGGACCCCGCCACGCAAGCCCGCGGCGACAACGACCCCATAGACGTCATAGAGATCGGCGAGAGGGTCGCGGCCAGAGGTGACGTCATCACCGTCAAGATACTCGGAACACTCGCTCTCATCGACGAAGGAGAGACAGACTGGAAGTTGATAGCCATCGACGTGAAGGACCCCGCGGCCGCCAGGATGAACGACGTGGCCGACGTGGAGACCGTGTTCCCGGGGCTACTGAGGGCCACCGTGGAGTGGTTCAGATTGTACAAGGTGCCGGACGGGAAGCCCGTCAACCAGTTCGCCTTCGACGGAGAGGCCAAAGACGCCGCCTTCGCGCACCGCGTCGTGGACGAGGTGCACGAGTTTTGGAAGGCGCTGGTGGCGGGGGAGGCCGCGGACGCCAGCGACATCTGTAAGATGAACGTGTCGGTCGCGAACAGCGCGGCGCGCGTGGAGCGCGGGGAGGCGGCGCGAGTGTTGGCCGGAGCTCCGCCGGCAGCTCTACCTCAGGACATACCAGCCAGCGTTGACAAGTGGCACTACTTGTCGAGTCTCTAA

Protein sequence:

>DPOGS211475-PA
MSVTRVLAASPRCLSFVKTLCASVGVVHVRKVTTVTETIRSKMFIAEERGSPFAPDYRVYFKDESGPVSPLHDIPLWADRGRREAHMVVEVPRWSNAKMEISLGEPLNPIKQDVKKGALRFVANVFPHHGYIWNYGALPQTWENPQHVDPATQARGDNDPIDVIEIGERVAARGDVITVKILGTLALIDEGETDWKLIAIDVKDPAAARMNDVADVETVFPGLLRATVEWFRLYKVPDGKPVNQFAFDGEAKDAAFAHRVVDEVHEFWKALVAGEAADASDICKMNVSVANSAARVERGEAARVLAGAPPAALPQDIPASVDKWHYLSSL-