About 9,700,000 results
Open links in new tab
  1. What are the pros and cons of the Apache Parquet format …

    Apr 24, 2016 · Some characteristics of Apache Parquet are: Self-describing Columnar format Language-independent In comparison to Apache Avro, Sequence Files, RC File etc. I want an …

  2. Extension of Apache parquet files, is it '.pqt' or '.parquet'?

    Oct 19, 2021 · I wonder if there is a consensus regarding the extension of parquet files. I have seen a shorter .pqt extension, which has typical 3-letters (like in csv, tsv, txt, etc) and then …

  3. How to append new data to an existing parquet file?

    Apr 27, 2023 · I have parquet files with some data in them. I want to add more data to them frequently every day. I want to do this without having to load the object to memory and then …

  4. How do I get schema / column names from parquet file?

    Nov 24, 2015 · Also, Cloudera (which supports and contributes heavily to Parquet) has a nice page with examples on usage of hangxie's parquet-tools. An example from that page for your …

  5. Python: save pandas data frame to parquet file - Stack Overflow

    Jan 29, 2019 · Is it possible to save a pandas data frame directly to a parquet file? If not, what would be the suggested process? The aim is to be able to send the parquet file to another …

  6. Updating values in apache parquet file - Stack Overflow

    Mar 3, 2015 · I have a quite hefty parquet file where I need to change values for one of the column. One way to do this would be to update those values in source text files and recreate …

  7. How to read a Parquet file into Pandas DataFrame?

    How to read a modestly sized Parquet data-set into an in-memory Pandas DataFrame without setting up a cluster computing infrastructure such as Hadoop or Spark? This is only a …

  8. Is it better to have one large parquet file or lots of smaller parquet ...

    Mar 28, 2017 · The only downside of larger parquet files is it takes more memory to create them. So you can watch out if you need to bump up Spark executors' memory. row groups are a way …

  9. How to view Apache Parquet file in Windows? - Stack Overflow

    Jun 19, 2018 · 98 What is Apache Parquet? Apache Parquet is a binary file format that stores data in a columnar fashion. Data inside a Parquet file is similar to an RDBMS style table where …

  10. Write null values to Parquet file with Parquet.Net creates an ...

    Apr 26, 2024 · I'm using Parquet.Net (4.23.5) to write parquet file. I discovered that when I want to write the value null in a datacolumn the generated parquet file in unreadable.