OJS OCS OMP OHS

You are viewing the PKP Support Forum | PKP Home Wiki



OJS search for Arabic content

OJS development discussion, enhancement requests, third-party patches and plug-ins.

Moderators: jmacgreg, btbell, michael, bdgregg, barbarah, asmecher

Forum rules
Developer Resources:

Documentation: The OJS Technical Reference and the OJS API Reference are both available from the OJS Documentation page.

Git: You can access our public Git Repository here. Comprehensive Git usage instructions are available on the wiki.

Bugzilla: You can access our Bugzilla report tracker here.

Search: You can use our Google Custom Search to search across our main website, the support forum, and Bugzilla.

Questions and discussion are welcome, but if you have a workflow or usability question you should probably post to the OJS Editorial Support and Discussion subforum; if you have a technical support question, try the OJS Technical Support subforum.

OJS search for Arabic content

Postby zain » Thu Feb 13, 2014 10:41 am

Hello
I have publish some Arabic content in my OJS .To see how the default Ojs search work with Arabic Content .OJS search function works fine with Arabic content and it give the result .But,I found the Arabic terms it look like this in OJS database:


586 | ??????
587 | ???????
588 | ?????
589 | ???
590 | ???
591 | ???
592 | ????
593 | ??????


.Can any one explain the reason and how to solve it .
thanks in advance,
Zain



Thanks in advance,
Zain
zain
 
Posts: 15
Joined: Mon Oct 21, 2013 3:19 pm

Re: OJS search for Arabic content

Postby asmecher » Thu Feb 13, 2014 10:46 am

Hi Zain,

Have you created your database to use UTF8? What character set configuration have you given OJS? (See your config.inc.php for the latter.)

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 7729
Joined: Wed Aug 10, 2005 12:56 pm

Re: OJS search for Arabic content

Postby zain » Thu Feb 13, 2014 5:01 pm

Hi
Thank you for your response .
In the config.inc.php the default locale in is en_USA and for the Database the default charset is utf8.
Do I need to create Arabic locale for OJS ?

The only think I need to investigate How the Default OJS search function works with Arabic Content ?

Best,
Zainab
zain
 
Posts: 15
Joined: Mon Oct 21, 2013 3:19 pm

Re: OJS search for Arabic content

Postby asmecher » Thu Feb 13, 2014 5:27 pm

Hi Zainab,

Can you double-check? At least "en_USA" is wrong. What are your exact settings for...

i18n.locale
i18n.client_charset
i18n.connection_charset
i18n.database_charset
i18n.charset_normalization

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 7729
Joined: Wed Aug 10, 2005 12:56 pm

Re: OJS search for Arabic content

Postby zain » Thu Feb 13, 2014 6:29 pm

Hi
this my i18n configration:

[i18n]

; Default locale
locale = en_US

; Client output/input character set
client_charset = utf-8

; Database connection character set
; Must be set to "Off" if not supported by the database server
; If enabled, must be the same character set as "client_charset"
; (although the actual name may differ slightly depending on the server)
connection_charset = utf8

; Database storage character set
; Must be set to "Off" if not supported by the database server
database_charset = utf8

; Enable character normalization to utf-8 (recommended)
; If disabled, strings will be passed through in their native encoding
; Note that client_charset and database collation must be set
; to "utf-8" for this to work, as characters are stored in utf-8
charset_normalization = Off

Best,
zainab
zain
 
Posts: 15
Joined: Mon Oct 21, 2013 3:19 pm

Re: OJS search for Arabic content

Postby asmecher » Thu Feb 13, 2014 8:03 pm

Hi Zainab,

Those settings are correct. Are you inspecting data in the database directly? If so, remember that the data may come from indexed galley files, which may not be UTF8-encoded.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 7729
Joined: Wed Aug 10, 2005 12:56 pm

Re: OJS search for Arabic content

Postby zain » Fri Feb 14, 2014 9:00 am

Hi ,
sorry, I am novice at OJS .
How can we know the encoding of galley files?
Also ,
I have another question
when do we need to use rebuild indexing tool ?
Regards,
Zainab
zain
 
Posts: 15
Joined: Mon Oct 21, 2013 3:19 pm

Re: OJS search for Arabic content

Postby asmecher » Fri Feb 14, 2014 9:24 am

Hi Zainab,

The encoding of galley files will depend on how you create them. For example, HTML galleys can be created in ISO-8859-1 (also known as Latin1) or UTF-8 or dozens of other encodings, depending on the settings you use in your HTML creation program.

Likewise, if you're extracting text from PDF files, extraction tools can use various encodings. See the config.inc.php file for some example command lines that specify encoding. Depending on the way you write PDFs and the tools you use, the results of the text extraction may be messy. You can run these tools by hand to see what content OJS is getting to work with.

It should only be necessary to run the index rebuild tool if you change your encoding or text extraction settings.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 7729
Joined: Wed Aug 10, 2005 12:56 pm

Re: OJS search for Arabic content

Postby zain » Sat Feb 15, 2014 11:06 am

Hi

Just to make sure so I don't need to create Arabic locale .
For publish and indexing Arabic journal .
Becuse evey things in my OJS setting is UTF8 encoding but, indexing is not in the correct format.
Thanks ,
Zainab
zain
 
Posts: 15
Joined: Mon Oct 21, 2013 3:19 pm

Re: OJS search for Arabic content

Postby asmecher » Sat Feb 15, 2014 3:30 pm

Hi Zainab,

What are you using to fetch the data from the database for inspection? Is it a command-line tool or something like phpMyAdmin? You need to be careful that whatever tool you're using can handle and display UTF8 as well.

Also, did you create your database with a different encoding than UTF8? You can check following the instructions here.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 7729
Joined: Wed Aug 10, 2005 12:56 pm

Re: OJS search for Arabic content

Postby zain » Sun Feb 16, 2014 5:10 pm

Thanks so much .
The defualt database install was latin1.
I think I can solve the problem now.
Best,
zainab
zain
 
Posts: 15
Joined: Mon Oct 21, 2013 3:19 pm

Re: OJS search for Arabic content

Postby asmecher » Mon Feb 17, 2014 9:41 am

Hi Zainab,

Great, glad to hear it.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 7729
Joined: Wed Aug 10, 2005 12:56 pm


Return to OJS Development

Who is online

Users browsing this forum: No registered users and 1 guest