{"id":836,"date":"2010-01-05T12:14:22","date_gmt":"2010-01-05T20:14:22","guid":{"rendered":"http:\/\/boost-spirit.com\/home\/?p=836"},"modified":"2010-01-22T08:15:04","modified_gmt":"2010-01-22T16:15:04","slug":"stream-based-parsing-made-easy","status":"publish","type":"post","link":"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/","title":{"rendered":"Stream-based Parsing Made Easy"},"content":{"rendered":"<p>We got many questions how to parse data which has to be accessed using a standard input stream (usually a <span style=\"font-family: Courier New;\">std::istream<\/span>) without having to read all of the data into memory first.<\/p>\n<p>The standard way of iterating over a stream would be to wrap it into a <span style=\"font-family: Courier New;\">std::istream_iterator<\/span>, but unfortunately this is not possible. <em>Qi<\/em> requires iterators to be at least <a href=\"http:\/\/www.sgi.com\/tech\/stl\/ForwardIterator.html\">forward iterators<\/a>, while the <span style=\"font-family: Courier New;\">std::istream_iterator<\/span> is an <a href=\"http:\/\/www.sgi.com\/tech\/stl\/InputIterator.html\">input iterator<\/a> only. To overcome this limitation Spirit V2.2 (the next version of Spirit, to be released with Boost V1.42) will implement a new iterator.<\/p>\n<p><!--more--><\/p>\n<pre class=\"brush: cpp; title: ; notranslate\" title=\"\">\r\nnamespace boost { namespace spirit\r\n{\r\n\u00a0\u00a0\u00a0 \/\/ this is functionally equivalent to std::istream_iterator\r\n\u00a0\u00a0\u00a0 template &lt;typename Char, typename Traits = std::char_traits&lt;Char&gt; &gt;\r\n\u00a0\u00a0\u00a0 struct basic_istream_iterator;\r\n\r\n\u00a0\u00a0\u00a0 \/\/ predefine specialization for 'char'\r\n\u00a0\u00a0\u00a0 typedef basic_istream_iterator&lt;char&gt; istream_iterator;\r\n}}\r\n<\/pre>\n<p>This iterator is functionally equivalent to the <span style=\"font-family: Courier New;\">std::istream_iterator<\/span> except that it is a <a href=\"http:\/\/www.sgi.com\/tech\/stl\/ForwardIterator.html\">forward iterator<\/a> you can utilize for your Qi parsing needs. Here is an example:<\/p>\n<pre class=\"brush: cpp; title: ; notranslate\" title=\"\">\r\n#include &lt;boost\/spirit\/include\/support_istream_iterator.hpp&gt;\r\n\r\nnamespace spirit = boost::spirit;\r\n\r\n\/\/ open file, disable skipping of whitespace\r\nstd::ifstream in(&quot;some_data_file&quot;);\r\nin.unsetf(std::ios::skipws);\r\n\r\n\/\/ wrap istream into iterator\r\nspirit::istream_iterator begin(in);\r\nspirit::istream_iterator end;\r\n\r\n\/\/ use iterator to parse file data\r\nspirit::qi::parse(begin, end, &lt;...your grammar here...&gt;);\r\n<\/pre>\n<p>This iterator is implemented on top of the <span style=\"font-family: Courier New;\">multi_pass<\/span> iterator framework, which is documented <a href=\"http:\/\/www.boost.org\/doc\/libs\/1_41_0\/libs\/spirit\/doc\/html\/spirit\/support\/multi_pass.html\">here<\/a>. For those curious enough to have a peek: you can access the new iterator from <a href=\"www.boost.org\">Boost<\/a> SVN <a href=\"http:\/\/svn.boost.org\/svn\/boost\/trunk\/boost\/spirit\/home\/support\/iterators\/istream_iterator.hpp\">here<\/a>. But in order to use it with Boost prior to V1.42 you will need to download the <a href=\"http:\/\/svn.boost.org\/svn\/boost\/trunk\/boost\/spirit\/home\/support\/iterators\/\">whole iterators directory<\/a> from SVN.<\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li><a href=\"#\" class=\"sharing-anchor sd-button share-more\"><span>Share<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><div class=\"sharing-hidden\"><div class=\"inner\" style=\"display: none;\"><ul><li class=\"share-facebook\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-facebook-836\" class=\"share-facebook sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=facebook\" target=\"_blank\" title=\"Click to share on Facebook\" ><span>Facebook<\/span><\/a><\/li><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-836\" class=\"share-twitter sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\" ><span>Twitter<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-pinterest\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-pinterest-836\" class=\"share-pinterest sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=pinterest\" target=\"_blank\" title=\"Click to share on Pinterest\" ><span>Pinterest<\/span><\/a><\/li><li class=\"share-linkedin\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-linkedin-836\" class=\"share-linkedin sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=linkedin\" target=\"_blank\" title=\"Click to share on LinkedIn\" ><span>LinkedIn<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-reddit\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-reddit sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=reddit\" target=\"_blank\" title=\"Click to share on Reddit\" ><span>Reddit<\/span><\/a><\/li><li class=\"share-tumblr\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-tumblr sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=tumblr\" target=\"_blank\" title=\"Click to share on Tumblr\" ><span>Tumblr<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>We got many questions how to parse data which has to be accessed using a standard input stream (usually a std::istream) without having to read all of the data into memory first. The standard way of iterating over a stream would be to wrap it into a std::istream_iterator, but unfortunately this is not possible. Qi [&hellip;]<\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li><a href=\"#\" class=\"sharing-anchor sd-button share-more\"><span>Share<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><div class=\"sharing-hidden\"><div class=\"inner\" style=\"display: none;\"><ul><li class=\"share-facebook\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-facebook-836\" class=\"share-facebook sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=facebook\" target=\"_blank\" title=\"Click to share on Facebook\" ><span>Facebook<\/span><\/a><\/li><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-836\" class=\"share-twitter sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\" ><span>Twitter<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-pinterest\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-pinterest-836\" class=\"share-pinterest sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=pinterest\" target=\"_blank\" title=\"Click to share on Pinterest\" ><span>Pinterest<\/span><\/a><\/li><li class=\"share-linkedin\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-linkedin-836\" class=\"share-linkedin sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=linkedin\" target=\"_blank\" title=\"Click to share on LinkedIn\" ><span>LinkedIn<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-reddit\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-reddit sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=reddit\" target=\"_blank\" title=\"Click to share on Reddit\" ><span>Reddit<\/span><\/a><\/li><li class=\"share-tumblr\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-tumblr sd-button share-icon\" href=\"http:\/\/boost-spirit.com\/home\/2010\/01\/05\/stream-based-parsing-made-easy\/?share=tumblr\" target=\"_blank\" title=\"Click to share on Tumblr\" ><span>Tumblr<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div><\/div><\/div>","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_s2mail":"","spay_email":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true},"categories":[9],"tags":[8],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/pIHdZ-du","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/posts\/836"}],"collection":[{"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/comments?post=836"}],"version-history":[{"count":11,"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/posts\/836\/revisions"}],"predecessor-version":[{"id":925,"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/posts\/836\/revisions\/925"}],"wp:attachment":[{"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/media?parent=836"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/categories?post=836"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/boost-spirit.com\/home\/wp-json\/wp\/v2\/tags?post=836"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}