Is std::ifstream significantly slower than FILE?

Question

I've been informed that my library is slower than it should be, on the order of 30+ times too slow parsing a particular file (text file, size 326 kb). The user suggested that it may be that I'm using std::ifstream (presumably instead of FILE).

I'd rather not blindly rewrite, so I thought I'd check here first, since my guess would be the bottleneck is elsewhere. I'm reading character by character, so the only functions I'm using are get(), peek(), and tellg()/seekg().

Update:

I profiled, and got confusing output - gprof didn't appear to think that it took so long. I rewrote the program to read the entire file into a buffer first, and it sped up by about 100x. I think the problem may have been the tellg()/seekg() that took a long time, but gprof may have been unable to see that for some reason. In any case, ifstream does not appear to buffer the entire file, even for this size.

Long live gprof, as a museum piece.
– Mike Dunlavey
Commented Dec 1, 2009 at 17:26 — Mike Dunlavey, Commented Dec 1, 2009 at 17:26

Stack Overflow is garbage · Accepted Answer · 2009-01-25 06:20:26Z

29

I don't think that'd make a difference. Especially if you're reading char by char, the overhead of I/O is likely to completely dominate anything else. Why do you read single bytes at a time? You know how extremely inefficient it is?

On a 326kb file, the fastest solution will most likely be to just read it into memory at once.

The difference between std::ifstream and the C equivalents, is basically a virtual function call or two. It may make a difference if executed a few tens of million times per second, otherwise, not reall. file I/O is generally so slow that the API used to access it doesn't really matter. What matters far more is the read/write pattern. Lots of seeks are bad, sequential reads/writes good.

answered Jan 25, 2009 at 6:20

Stack Overflow is garbage

246k51 gold badges351 silver badges555 bronze badges

1

Actually, I didn't know how inefficient it is. I just assumed that behind the scenes, it read it into memory. I guess I'll do that instead.
– Jesse Beder
Commented Jan 25, 2009 at 6:38
4

Some input streams are buffered. If your code reads one char at a time it doesn't mean that underlying stream does that too.
– jfs
Commented Jan 25, 2009 at 6:49
5

Both FILE and fstream are buffered (although the buffer maybe too small), linux heavily optimizes disk access, so your file which is relatively small will be loaded in memory (windows also does this).
– Ismael
Commented Jan 25, 2009 at 6:51
2

Depends on how much it buffers and such. I'm willing to bet that reading the entire file in one go will still be faster.
– Stack Overflow is garbage
Commented Jan 25, 2009 at 7:37
@jalf: Easy statement to make. It may be faster but I am willing to bet not signioficantly.
– Loki Astari
Commented Jan 25, 2009 at 7:39

| Show 1 more comment

PolyThinker · Accepted Answer · 2009-01-25 06:11:33Z

4

It should be slightly slower, but like what you said, it might not be the bottleneck. Why don't you profile your program and see if that's the case?

answered Jan 25, 2009 at 6:11

PolyThinker

5,20822 silver badges22 bronze badges

Add a comment |

jfs · Accepted Answer · 2009-01-25 06:47:27Z

3

All benchmarks are evil. Just profile your code for the data you expect.

I performed an I/O performance comparison between Ruby, Python, Perl, C++ once. For my data, languages versions, etc C++'s variant was several times slower (it was a big suprise at that time).

answered Jan 25, 2009 at 6:47

jfs

410k201 gold badges1k silver badges1.7k bronze badges

What about C? I'd be very surprised if any of the mentioned languages besides C++ was operating faster than C.
– rr-
Commented Mar 7, 2015 at 18:56
@rr- if your disk can't provide data faster than 100M/s then it doesn't matter that your C program may process 1G/s. As other answers have said already, disk i/o is typically much slower than anything else in your program. See related questions on I/O performance: Why is reading lines from stdin much slower in C++ than Python? and Reading in an entire file at once in C++, part 2
– jfs
Commented Mar 7, 2015 at 19:33
In your post you said C++ was behaving the worst among Ruby, Python and others. This should have nothing to do with I/O performance, which is a bottleneck for all of these languages uniformly. And that's where I'd be surprised to find out that C is behaving worse than Ruby, for example, mainly because Ruby is written in C.
– rr-
Commented Mar 7, 2015 at 19:43
@rr-:1. if I/O dominates; language doesn't matter unless you're Google. 2. If the file is cached then read the links I've provided to see how programs written in the same language may show different performance results for the same problem
– jfs
Commented Mar 7, 2015 at 19:50

Add a comment |

Ryan Ginstrom · Accepted Answer · 2009-02-01 23:43:44Z

3

I agree that you should profile. But if you're reading the file a character at a time, how about creating a memory-mapped file? That way you can treat the file like an array of characters, and the OS should take care of all the low-level buffering for you. The simplest and probably fastest solution is a win in my book. :)

answered Feb 1, 2009 at 23:43

Ryan Ginstrom

14.1k5 gold badges47 silver badges60 bronze badges

Add a comment |

osgx · Accepted Answer · 2010-12-19 02:45:27Z

I thinks that is unlikely your problem will be fixed by switching from fstream to FILE*, usually both are buffered by the C library. Also the OS can cache reads (linux is very good in that aspect). Given the size of the file you are accessing is pretty likely it will be entirely in RAM.

Like PolyThinker say your best bet is to run your program trough an profiler an determine where the problem is.

Also you are using seekg/tellg this can cause notable delays if your disk is heavily fragmented, because to read the file for the first time the disk have to move the heads to the correct position.

Community · Accepted Answer · 2017-05-23 12:10:34Z

Here is an excellent benchmark which shows that under extreme conditions, fstreams are actually quite slow... unless:

You use buffering (I cannot stress that enough)
You manipulate the buffer yourself (that is, if you need performance such as OP in the linked question), which is not so different from using FILE*.

You shouldn't optimize prematurely, though. fstreams are generally better, and if you need to optimize them down in the road, you can always do it later with little cost. In order to prepare for the worst in advance, I suggest creating a minimal proxy for fstream now so that you can optimize it later, without need to touch anything else.

Collectives™ on Stack Overflow

Is std::ifstream significantly slower than FILE?

6 Answers 6

Not the answer you're looking for? Browse other questions tagged
c++
optimization
file-io
ifstream
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

Not the answer you're looking for? Browse other questions tagged c++optimizationfile-ioifstream or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
c++
optimization
file-io
ifstream
or ask your own question.