×

Notice

The forum is in read only mode.
Welcome, Guest
Username: Password: Remember me
  • Page:
  • 1

TOPIC: UTF8 image filenames not encoded right after parse

UTF8 image filenames not encoded right after parse 11 years 6 months ago #21004

  • Yigal Eran
  • Yigal Eran's Avatar Topic Author
  • Offline
  • Fresh Boarder
  • Fresh Boarder
  • Posts: 4
  • Thank you received: 0
Hi,

I have made a migration test of few Wordpress posts into Jommla K2 items.
Everything worked perfectly.

Then I parsed the newly imported items and all images were transferred to
to images/cmigration folder (not media/images folder, as written in the User's Guide. see https://compojoom.com/support/documentation/cmigrator/ch03s04 ). But this is only a minor remark.

The main problem is that original image filenames, that were encoded in UTF-8 (Hebrew), were newly created in some other charset (maybe it is ANSI) which is displayed as giberish.

1. Here is the original file name:
סולידאריות.jpg

2. Here is the filename as it appears in right-click "Copy image URL":
%D7%A1%D7%95%D7%9C%D7%99%D7%93%D7%90%D7%A8%D7%99%D7%95%D7%AA.jpg

(the original full URL is:
kav.org.il/image/users/100994/ftp/my_fil...D7%AA.jpg?id=9270704 )

3. Here is the newly created filename after "Parse" stage:
׳¡׳•׳œ׳™׳“׳׳¨׳™׳•׳×.jpg

When I manually rename the image, to the original filename, it is display correctly. That means that the src attribute of the img tag is written correctly.

Thanks

UTF8 image filenames not encoded right after parse 11 years 6 months ago #21019

  • Daniel Dimitrov
  • Daniel Dimitrov's Avatar
  • Offline
  • Administrator
  • Administrator
  • Posts: 9618
  • Karma: 155
  • Thank you received: 1081
The problem is that PHP cannot handle UTF-8 filenames.

stackoverflow.com/questions/6467501/php-...te-unicode-filenames - this reply says that this is the case on windows, but not on linux. So my advise would be to test the import on a server with linux & see if you can go around the encoding problem.

How many images do you have that are in UTF8?

UTF8 image filenames not encoded right after parse 11 years 6 months ago #21092

  • Yigal Eran
  • Yigal Eran's Avatar Topic Author
  • Offline
  • Fresh Boarder
  • Fresh Boarder
  • Posts: 4
  • Thank you received: 0
Hi,

Thanks for the accurate reply. You were right, the problem was gone when I did the import in Linux hosting.

Thank you very much for the excellent extension and support :)

UTF8 image filenames not encoded right after parse 11 years 6 months ago #21094

  • Daniel Dimitrov
  • Daniel Dimitrov's Avatar
  • Offline
  • Administrator
  • Administrator
  • Posts: 9618
  • Karma: 155
  • Thank you received: 1081
Thank you for getting back to me! Good to know that it works with linux hosting :)

Don't forget to leave a review over here:
extensions.joomla.org/extensions/migrati...mport-a-export/19583

Thanks!
Daniel
  • Page:
  • 1
Time to create page: 0.165 seconds