Re: Parquet support
От | Brian M Hamlin |
---|---|
Тема | Re: Parquet support |
Дата | |
Msg-id | d99556ac-35df-18b2-960b-2555377ae090@light42.com обсуждение исходный текст |
Ответ на | Re: Parquet support (Daniele Varrazzo <daniele.varrazzo@gmail.com>) |
Список | psycopg |
Hi - desktop linux user/maker here in California -- The engineering stakes are high in the clouds these days. There are some important efforts underway to make "cloud-native" ways for python, python installation, python data and python communication tools. In my corners of the world (remote sensing, urban planning) that means DASK and xarray. As a desktop linux distribution, we/OSGeoLive ship both, and enthusiastically so.. the "cloud-native" data storage formats ZARR and parquet, not so much. My best understanding is xarray is a happy medium between "what only runs on cloud" and "the powerful Linux I can run myself on standard equipment today" .. I support a python ecosystem that individual people can run entirely locally, and can interoperate well with standard networking and data formats. Not every python environment is doing that.. change happens interested to see the common and useful Python discussion here, regarding Postgresql, PostGIS and cloudy interoperability. --Brian M Hamlin / MAPLABS / OSGeoLive PSC On 11/23/22 12:00, Daniele Varrazzo wrote: > On Wed, 23 Nov 2022 at 20:56, Vladimir Ryabtsev <greatvovan@gmail.com> wrote: >> Just curious folks, what are your thoughts about the scope of that potential support? What is the use case? Is it loadingdata from Parquet to Postgres (and back)? Why is the combination with Python modules like pyarrow not enough? > I am not an expert, but I understand that Python-Postgres roundtrip > goes via generating and parsing CSV files, whereas there is some > performance gain to be had by creating native arrow data. > > -- Daniele > >
В списке psycopg по дате отправления: