Getting ISBNs and titles for JSTOR DDA

Stacy Pober stacy.pober at MANHATTAN.EDU
Sat Mar 29 23:37:48 EDT 2014


Heather and Paul asked about replacing the current separator between the
ISBN numbers with another element, such as a tab, that would be recognized
as a column separator.

Unfortunately, the report from Liblime Koha comes with multiple ISBN
numbers separated by SPACES.  Not tabs, not pipes, ,not commas - spaces.
 If there was a reasonable "special character" used to separate the
multiple ISBN numbers, I wouldn't have this problem.  <sigh>

 Many of the ISBN 020$a and 020$z fields legitimately have spaces, because
they list formats, prices and publisher names.

In most records, the extra information in the 020$a is separated by
punctuation, such as an open parenthesis character or a colon, but that's
not always the case.  Some of the extra information has alphabetic
characters, but some of them are numeric.

Here are some examples of what I'm getting in the 020$a field of the
report:
0675204092 0671204092 9780671204099
0387054995 (New York)
0030810213 Trade ed.
0300015186 (pbk.) 0300015178 : $7.50
0389011592 : 55/-
0444100474 (American Elsevier)
0395109485 (v. 1) 9780395109489 (v. 1) 0395204461 (v. 2) 9780395204467 (v.
2)
026264004X (v. 1) varies
0719004012 : 60/-
3540050892 DM38.00 ($10.50 U.S.)
0521072336(v.1)


Most of the extra data is clearly either a price, format, or publisher, but
I don't know what the "55/-" "60/-" are supposed to indicate.

This is why I haven't been able to figure a way to manpulate this with a
large-file text editor. I do have the full version of EmEditor for
situations like this, but I'm not much for writing regular expressions.

I suppose one way to think of this would be that I need a regex to insert a
tab before each continuous 10 or 13 character numeric string.

I found that pasting large amounts of data into an OpenOffice spreadsheet
takes an enormous amount of time, and ties up a good bit of my processing
power for way too long . If someone would suggest another open source
spreadsheet program that is better for this, I'd be glad to switch.

Of course, if I can't figure out how to get the ISBNs into multiple
columns, then the problem of pasting them back into the spreadsheet won't
really come up....

Thanks to all who offered help.  I will be contacting some folks privately.

Stacy



>
> On Thu, Mar 27, 2014 at 7:34 AM, Paul Poulain <paul.poulain at biblibre.com>
> wrote:
> Le 27/03/2014 12:22, Cockerill, Alan a écrit :
> >>             Can anyone suggest a way to get the Koha report of
> >>             concatenated ISBNs to translate into one ISBN per column?
> Hi Stacy,
>
> (I almost missed this thread, "Koha" is not written in the title...)
>
> If I understand correctly, you get concatenated ISBN like
> 3453459018|0980989823|04980989081 in biblioitems.isbn  field, right ?
>
> What about replacing | by something that can be interpreted as a <CR> ?
> You have the REPLACE mysql function :
>
> https://dev.mysql.com/doc/refman/5.0/en/string-functions.html#function_replace
>
> Something like
> SELECT REPLACE(biblioitems.isbn,'|',CONCAT(CHAR(10),CHAR(13))) FROM
> biblioitems.isbn
> could be useful ?
>
> Good luck
>
>
> --
> Paul POULAIN - BibLibre
> http://www.biblibre.com
> Free & Open Source Softwares for libraries
> Koha, Drupal, Piwik, Jasper
>

> Klish, Heather J Heather.Klish at tufts.edu via listserv.nd.edu
> Mar 26 (3 days ago)
>
> to WEB4LIB
> Stacy,
>
> Are the ISBNs in the column separated by a delimiter?   Have you tried
using Excel's Text to Columns wizard?
>
>
http://office.microsoft.com/en-us/excel-help/split-names-by-using-the-convert-text-to-columns-wizard-HA010102340.aspx
>
> Feel free to contact me off-list if you need additional help or if the
above option doesn't work for you.
>
> Heather
> · · · · · · · · · · · · · · ·
> Heather Klish
> Systems Librarian
> University Library Technology Services
> heather.klish at tufts.edu
> 617.627.5853
>
> sent from my phone




-- 
Stacy Pober
Information Alchemist
Manhattan College Library
Riverdale, NY 10471
stacy.pober at manhattan.edu

============================

To unsubscribe: http://bit.ly/web4lib

Web4Lib Web Site: http://web4lib.org/

2014-03-29
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.nd.edu/pipermail/web4lib/attachments/20140329/8674dee8/attachment.htm>


More information about the Web4lib mailing list