Monarch geneset OGS2.0

DPOGS209820
TranscriptDPOGS209820-TA1122 bp
ProteinDPOGS209820-PA373 aa
Genomic positionDPSCF300117 + 408133-412264
RNAseq coverage130x (Rank: top 56%)
Annotation
HeliconiusHMEL0089940.078.23% 
BombyxBGIBMGA008030-TA9e-12957.53% 
Drosophila% 
EBI UniRef50UniRef50_B0X1J68e-11653.70%Allantoicase n=4 Tax=Culicidae RepID=B0X1J6_CULQU
NCBI RefSeqXP_001657333.14e-11756.64%allantoicase [Aedes aegypti]
NCBI nr blastpgi|1573269911e-11656.91%allantoicase [Aedes aegypti]
NCBI nr blastxgi|1573269912e-11456.25%allantoicase [Aedes aegypti]
Group
Gene OntologyGO:00040374.5e-174allantoicase activity
KEGG pathwayaag:AaeL_AAEL0140451e-116 
 K01477 (E3.5.3.4, ALLC, alc)maps-> Purine metabolism
InterPro domain[2-371] IPR0051644.5e-174Allantoicase
[8-200] IPR0089798.4e-54Galactose-binding domain-like
[23-195] IPR0159083.3e-53Allantoicase domain
Orthology groupMCL14656 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209820-TA
ATGATTGAAAAAATGGAAGAAATACCCGCCTTTGCCGCTCTAAGTGAATTTGCTAGCAGTAGCGCCGGCGGCCAAGTTCTATTCGCAACCGACGATTTCTTCGCACCATGTGAAAACATGATATTAGATACCGAGCCAGTTTTCATAGCTGATAAATACACTGAGTACGGAAAATGGATGGATGGCTGGGAGACTCAAAGGAAGAGAATACCTGGTCATGATTGGTGCATCATAAAACTGGCTACAAAATGTGTTATAAGAGGTTTATTAATAGATACAGCTTTTTTCTCGGGGAACTACGCACCGAAATACTCTATACAGGCCGCTTGTTTAACACCAGAAGAAGAGGCGTTATTGCCCGAAAGGGACTCAGAAATGGGGTCAGCGTGCACCGAATGTGACTTGGAACGGGTAAAACAGCTTAGGACTGACAAATGGGAGGAAATTGTCCCTATAACTGCTTTGCGACCCGGCTATGAAGAGACCAGGATGAACTTCCAGAAGGTACTATGTGATGAAGCGTGGACGCACATACGCGTGAATATTTACCCGGATGGCGGCATTGCAAGACTTCGCGTATACGGTGAAGCAAAGCCGGAGCTGCCAGCCAGTGATCACTTAATCGATTTAATATCATTATTAAATGGTGGAACTTGCTTGGAATATTCAAATGCACATTACGGTCACCCGAGAAACGTGATAAAGCCTTGCAAAAGCCAGGCAATGTCTGACGGTTGGGAAACAGCCAGGAGATTAGACAGACCTGAAGTACTCGAGACGTATGATGACGGAACTTTGAAAGTATCGGGAGAAGAATGGTCGATTTTCAAATTGGGTTTTTGTGGAAGAATTACAAACATCTGTGTTGATACCGCACATTTCAAAGGCAATTATCCGGATACGATTAAAATAGAAGGAGCATTTGTTACAAGTGAATGGGCACAATCAAATAACATTACTTGGTTCAATATACTGAAACGTAGCAAGCTATCGCCTCATAAGGAGCATTGGTTCACGTGTAAATCAGATGTTGTGTCTCATATTCGTGTTACAATAGGTCCGGACGGAGGACTTAGCCGACTTAGAACATTTGGTTACGTGCAACCAGCAATTATTATTTGA

Protein sequence:

>DPOGS209820-PA
MIEKMEEIPAFAALSEFASSSAGGQVLFATDDFFAPCENMILDTEPVFIADKYTEYGKWMDGWETQRKRIPGHDWCIIKLATKCVIRGLLIDTAFFSGNYAPKYSIQAACLTPEEEALLPERDSEMGSACTECDLERVKQLRTDKWEEIVPITALRPGYEETRMNFQKVLCDEAWTHIRVNIYPDGGIARLRVYGEAKPELPASDHLIDLISLLNGGTCLEYSNAHYGHPRNVIKPCKSQAMSDGWETARRLDRPEVLETYDDGTLKVSGEEWSIFKLGFCGRITNICVDTAHFKGNYPDTIKIEGAFVTSEWAQSNNITWFNILKRSKLSPHKEHWFTCKSDVVSHIRVTIGPDGGLSRLRTFGYVQPAIII-