BookShared
  • MEMBER AREA    
  • DuckDB in Action

    (By Mark Needham)

    Book Cover Watermark PDF Icon Read Ebook
    ×
    Size 27 MB (27,086 KB)
    Format PDF
    Downloaded 668 times
    Last checked 14 Hour ago!
    Author Mark Needham
    “Book Descriptions: Dive into DuckDB and start processing gigabytes of data with ease—all with no data warehouse.

    You don’t need expensive hardware or to spin up a whole new cluster whenever you want to analyze a big data set. You just need DuckDB! This modern and fast embedded database runs on a laptop, and lets you easily process data from almost any source, including JSON, CSV, Parquet, SQLite and Postgres. In DuckDB in Action you’ll learn everything you need to know to get the most out of this awesome tool, keep your data secure on prem, and save you hundreds on your cloud bill.

    Open up DuckDB in Action and learn how to:

    - Read and process data from CSV, JSON and Parquet sources both locally and remote
    - Write analytical SQL queries, including aggregations, common table expressions, window functions, special types of joins, and pivot tables
    - Use DuckDB from Python, both with SQL and its "Relational"-API, interacting with databases but also data frames
    - Prepare, ingest and query large datasets
    - Build cloud data pipelines

    Extend DuckDB with custom functionality

    DuckDB in Action introduces the DuckDB database and shows you how to use it to solve common data workflow problems. It’s full of quick wins—right from chapter one, you’ll be finding new ways that DuckDB can speed up your work as a data professional. Each new concept is paired with a hands-on project example, so you can easily see how DuckDB works in action.

    about the book

    DuckDB in Action will show you how to quickly get your hands dirty with DuckDB. You won’t need to read through pages of documentation—you’ll learn as you work. Begin with DuckDB’s CLI embedded mode, then dive straight into modern SQL queries and utilizing DuckDB’s handy SQL extensions. From there, you’ll explore the different ways you can analyze data with DuckDB, including advanced aggregation and analysis, data without persistence, and DuckDB’s underlying architecture. Learn how to combine DuckDB with the Python ecosystem for even greater customization, and how to extend DuckDB with its own tools. You’ll take to DuckDB like a duck to water, rapidly solving almost any relational data task with zero friction.”

    Google Drive Logo DRIVE
    Book 1

    Learn Rust in a Month of Lunches

    ★★★★★

    David MacLeod

    Book 1

    Accelerate: Building and Scaling High Performing Technology Organizations

    ★★★★★

    Nicole Forsgren

    Book 1

    The Black Swan: The Impact of the Highly Improbable

    ★★★★★

    Nassim Nicholas Taleb

    Book 1

    A City on Mars: Can We Settle Space, Should We Settle Space, and Have We Really Thought This Through?

    ★★★★★

    Kelly Weinersmith

    Book 1

    The Phoenix Project: A Novel About IT, DevOps, and Helping Your Business Win

    ★★★★★

    Gene Kim

    Book 1

    Antifragile: Things That Gain from Disorder

    ★★★★★

    Nassim Nicholas Taleb

    Book 1

    Data Mesh: Delivering Data-Driven Value at Scale

    ★★★★★

    Zhamak Dehghani

    Book 1

    On Tyranny: Twenty Lessons from the Twentieth Century

    ★★★★★

    Timothy Snyder

    Book 1

    Team Topologies: Organizing Business and Technology Teams for Fast Flow

    ★★★★★

    Matthew Skelton

    Book 1

    A Philosophy of Software Design

    ★★★★★

    John Ousterhout

    Book 1

    Hooked: How to Build Habit-Forming Products

    ★★★★★

    Nir Eyal

    Book 1

    Swann’s Way (In Search of Lost Time, #1)

    ★★★★★

    Marcel Proust

    Book 1

    Cat’s Cradle

    ★★★★★

    Kurt Vonnegut Jr.

    Book 1

    Database Internals: A deep-dive into how distributed data systems work

    ★★★★★

    Alex Petrov