BLU Discuss list archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Asynchronous File I/O on Linux

Subject: Asynchronous File I/O on Linux
From: bogstad-e+AXbWqSrlAAvxtiuMwx3w at public.gmane.org (Bill Bogstad)
Date: Wed, 19 May 2010 13:09:44 -0400
In-reply-to: <68877CD0-25EA-4E37-B265-EDDFC6AC1BD7-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
References: <mailman.90988.1274159676.8097.discuss@blu.org> <4BF28EFD.8050304@mohawksoft.com> <8947B611-EA2B-4923-9921-85C6E3E3119A@gmail.com> <AANLkTilLEV3_vU_pmcB9FRWckJ7mWYS7w4vhgEkVJV5i@mail.gmail.com> <F56F6DB2-C40B-4D3C-8432-EA4B264D3FFA@gmail.com> <AANLkTinmlUayaxJCPYZ53LKFs_5f_nG3cow0kVGWpf7H@mail.gmail.com> <88F43E12-D6CF-4142-AE25-5436E2295325@gmail.com> <AANLkTilrnRQFH5mZtmMP7_UXekBRMG3UFs84CpXn70uM@mail.gmail.com> <68877CD0-25EA-4E37-B265-EDDFC6AC1BD7@gmail.com>

On Wed, May 19, 2010 at 10:32 AM, Richard Pieri <richard.pieri-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org> wrote:

>> Caching won't help me if I only want to look at each chunk once. ?If
>> the data was in the file sequentially then
>> the built-in kernel readahead would help. ?If the file format is fixed
>> and I want to process the data in some other order
>> then sequential then the simplistic kernel readhead isn't going to
>> help (and may make things slower).
>
> Yeah... see... the problem now is the file storage format. ?What you really want now is an index into the actual data: find what you want from the index and use that pointer to jump immediately to the data you want instead of having to seek across Ghu knows how much file. ?As I said, this has been solved before.

Err, how do you "jump immediately to the data" without "having to
seek"?   The only way I know to "jump immediately.." via
Linux/POSIX APIs is explicitly with lseek() (or implicitly with pread()).

lseek() is cheap since all the kernel has to do is change it's
internal offset counter for the file descriptor associated with a disk
file.  It's only when you do the subsequent read() that any real cost
is incurred.  Assuming uncached disk files, that is likely to require
disk head seeks which is where the time cost comes into play and I see
no way around that.

Bill Bogstad

References:
- Asynchronous File I/O on Linux
  - From: markw-FJ05HQ0HCKaWd6l5hS35sQ at public.gmane.org (Mark Woodward)
- Asynchronous File I/O on Linux
  - From: richard.pieri-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org (Richard Pieri)
- Asynchronous File I/O on Linux
  - From: bogstad-e+AXbWqSrlAAvxtiuMwx3w at public.gmane.org (Bill Bogstad)
- Asynchronous File I/O on Linux
  - From: richard.pieri-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org (Richard Pieri)
- Asynchronous File I/O on Linux
  - From: bogstad-e+AXbWqSrlAAvxtiuMwx3w at public.gmane.org (Bill Bogstad)
- Asynchronous File I/O on Linux
  - From: richard.pieri-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org (Richard Pieri)
- Asynchronous File I/O on Linux
  - From: bogstad-e+AXbWqSrlAAvxtiuMwx3w at public.gmane.org (Bill Bogstad)
- Asynchronous File I/O on Linux
  - From: richard.pieri-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org (Richard Pieri)

Prev by Date: Asynchronous File I/O on Linux
Next by Date: Asynchronous File I/O on Linux
Previous by thread: Asynchronous File I/O on Linux
Next by thread: Asynchronous File I/O on Linux
Index(es):
- Date
- Thread


BLU is a member of BostonUserGroups
We also thank MIT for the use of their facilities.

Boston Linux & Unix / webmaster@blu.org