← Back to team overview

desktop-packages team mailing list archive

[Bug 694188] Re: [Upstream] Office does not try to detect character set of CSV

 

Launchpad has imported 4 comments from the remote bug at
https://bugs.freedesktop.org/show_bug.cgi?id=35019.

If you reply to an imported comment from within Launchpad, your comment
will be sent to the remote bug automatically. Read more about
Launchpad's inter-bugtracker facilities at
https://help.launchpad.net/InterBugTracking.

------------------------------------------------------------------------
On 2011-03-04T08:11:09+00:00 Björn Michaelsen wrote:

It would be great to have charset autodetection as an option for the csv
import in Calc. Currently the last charset gets remembered.

I imagine that this could cause quite a bit of trouble to endusers who
accidentally change their charset import setting to something that is
not the default, but looks like an innocent choice (e.g. "Unicode").

see also:

 https://bugs.launchpad.net/ubuntu/+source/openoffice.org/+bug/694188

Reply at: https://bugs.launchpad.net/df-
libreoffice/+bug/694188/comments/2

------------------------------------------------------------------------
On 2011-03-04T09:04:09+00:00 Björn Michaelsen wrote:

There is already a basic implementation of charset detection implemented
in the writer text import as SwIoSystem::IsDetectableText:

http://opengrok.libreoffice.org/xref/writer/sw/source/filter/basflt/iodetect.cxx#427

It old and ugly, but could be a starting point. Obviously, it would have
to be moved out of writer and polished a bit so that it can be used in
other applications too.

Reply at: https://bugs.launchpad.net/df-
libreoffice/+bug/694188/comments/3

------------------------------------------------------------------------
On 2011-06-24T04:59:53+00:00 Alexander-balzer wrote:

Would be nice if the implementation would also work with 38637 - Better
handling for csv-Files

Reply at: https://bugs.launchpad.net/df-
libreoffice/+bug/694188/comments/8

------------------------------------------------------------------------
On 2011-12-23T11:50:46+00:00 Björn Michaelsen wrote:

[This is an automated message.]
This bug was filed before the changes to Bugzilla on 2011-10-16. Thus it
started right out as NEW without ever being explicitly confirmed. The bug is
changed to state NEEDINFO for this reason. To move this bug from NEEDINFO back
to NEW please check if the bug still persists with the 3.5.0 beta1 or beta2 prereleases.
Details on how to test the 3.5.0 beta1 can be found at:
http://wiki.documentfoundation.org/QA/BugHunting_Session_3.5.0.-1

more detail on this bulk operation: http://nabble.documentfoundation.org
/RFC-Operation-Spamzilla-tp3607474p3607474.html

Reply at: https://bugs.launchpad.net/df-
libreoffice/+bug/694188/comments/14

-- 
You received this bug notification because you are a member of Desktop
Packages, which is subscribed to libreoffice in Ubuntu.
https://bugs.launchpad.net/bugs/694188

Title:
  [Upstream] Office does not try to detect character set of CSV

Status in LibreOffice Productivity Suite:
  Confirmed
Status in The OpenOffice.org Suite:
  Invalid
Status in “libreoffice” package in Ubuntu:
  Won't Fix
Status in “openoffice.org” package in Ubuntu:
  Won't Fix

Bug description:
  Binary package hint: openoffice.org

  When importing a CSV file in OOo Calc, the Lucid version of OOo does
  not recognize the correct character set. The CSV import module just
  seems to default to the last character set used.

  When doing this with the OOo from 8.04LTS, or with the official
  version from Openoffice.org, autodetection does it's job.

  How to reproduce:
  1) make one or two CSV files, for example:
  echo '"text","test","vest","vast"' > /tmp/1.csv
  echo '"text","test","vest","vast"' > /tmp/2.csv
  2) open it with OpenOffice.org:
  openoffice.org /tmp/1.csv
  Choose "unicode" for import. (This is wrong, but please do this anyway).
  The result shows various non-Latin characters, which is expected as we chose the wrong import character set.
  Now close the file and reopen any other file (or, for that matter, the same file):
  openoffice.org /tmp/2.csv
  Now Unicode is the default import type. There simply seems to be no autodetection.

  ProblemType: Bug
  DistroRelease: Ubuntu 10.04
  Package: openoffice.org-calc 1:3.2.0-7ubuntu4.1
  ProcVersionSignature: Ubuntu 2.6.32-26.48-generic 2.6.32.24+drm33.11
  Uname: Linux 2.6.32-26-generic i686
  Architecture: i386
  Date: Fri Dec 24 15:39:04 2010
  InstallationMedia: Ubuntu 10.04 "Lucid Lynx" - Release Candidate i386 (20100419.1)
  ProcEnviron:
   LANGUAGE=en_US:en_CA:en
   LANG=en_US.utf8
   SHELL=/bin/bash
  SourcePackage: openoffice.org

To manage notifications about this bug go to:
https://bugs.launchpad.net/df-libreoffice/+bug/694188/+subscriptions