Data science

[ follow ]
#data-integration
fromHackernoon
3 months ago
Data science

Kishore's Leadership in STIBO MDM & Strategic AI Implementation at a Major Healthcare Organization | HackerNoon

fromHackernoon
3 months ago
Data science

Kishore's Leadership in STIBO MDM & Strategic AI Implementation at a Major Healthcare Organization | HackerNoon

Data science
fromcointelegraph.com
3 days ago

How to use ChatGPT to find hidden gems in the crypto market

ChatGPT and AI tools can synthesize sentiment, onchain and technical data, and automated scanners to identify high-potential crypto tokens before mainstream attention.
fromPrivacy International
6 days ago

How Data Drives the Militarisation of Tech

There's a revolution occurring in how war and conflict are waged. New data-intensive systems are being developed; and commercial tech infrastructure is now supporting military operations. Data plays a key role in this revolution. Data is used to train and test systems, and the systems are fed data to target operations, communities, and individuals. While intelligence has long informed warfare, now we're seeing the very same dynamics that gave rise to surveillance capitalism feed a new era of innovation, feed a new era of innovation,
Data science
#chart-templates
#data-engineering
fromInfoQ
1 week ago
Data science

How Netflix Powers Audience Insights at Trillion-Row Scale

Netflix scaled Muse to trillions of rows, reducing query latency about 50% by using HyperLogLog sketches and redesigning the data serving layer.
fromMedium
1 month ago
Data science

Building Resilient Data Systems: Key Lessons from Veronika Durgin

Neglected data engineering tasks are crucial for stable and agile data pipelines.
fromABC7 San Francisco
1 week ago

SF engineer creates 'Find My Parking Cops' app; SFMTA disables it 4 hours later

"It's a rip off 'Find my Friends.' I was able to reverse engineer the SF parking ticket system so I could see close to real time where parking tickets were issued in the city. And I was making a map of where the actual parking cops were as they traverse the city and issue tickets. In theory, you could use that to avoid them and avoid a ticket," said Walz.
Data science
Data science
fromcointelegraph.com
1 week ago

How to use Grok 4 to research coins before you invest

Use Grok 4 to convert social hype into structured signals by scanning sentiment, summarizing fundamentals, and confirming onchain data before investing.
Data science
fromTechCrunch
1 week ago

Alloy is bringing data management to the robotics industry | TechCrunch

Alloy provides data infrastructure that encodes, labels, and enables natural-language search and rules-based observability to organize and detect issues in massive robot-generated datasets.
Data science
fromFlowingData
1 week ago

Trust and transparency in government data

Reliable statistical data enables evidence-based social programs and prevents policymakers from operating 'blind' or following biased directions.
Data science
fromLondon Business News | Londonlovesbusiness.com
1 week ago

Transforming raw data into business insights with a data analytics agency - London Business News | Londonlovesbusiness.com

Agencies and data lake consulting transform siloed, overwhelming raw data into actionable insights by integrating sources, applying advanced analytics, and building scalable infrastructure for decision-making.
Data science
fromSocial Media Explorer
1 week ago

The Social Power of Extracting Insights from Data Warehouse - Social Media Explorer

Centralizing healthcare data in a data warehouse reduces fragmentation and privacy risk while enabling trusted analytics that improve patient outcomes.
#ai
fromInfoWorld
2 months ago
Data science

Orchestrating AI-driven data pipelines with Azure ADF and Databricks: An architectural evolution

fromInfoWorld
2 months ago
Data science

Orchestrating AI-driven data pipelines with Azure ADF and Databricks: An architectural evolution

#python
#data-lineage
fromTechzine Global
1 week ago
Data science

Tracking data lineage from data archaeology to digital twins

Organizations must implement live, granular data lineage and metadata management to govern provenance, ensure compliance, trace transformations, and mitigate risks across data flows.
fromTechzine Global
2 months ago
Data science

Ataccama underlines AI data lineage for business users

Ataccama's platform allows business users to understand data lineage without SQL, improving data trust and decision-making.
Data science
fromInfoWorld
1 week ago

How AI changes the data analyst role

Analysts must adopt AI as a collaborator, deepen domain expertise, validate AI outputs, and become data storytellers while organizations provide evolving career paths and governance.
Data science
fromTechzine Global
1 week ago

How important is data analytics in cycling?

Data analytics acts as an essential, integrated teammate delivering marginal gains across rider performance, recruitment, logistics, and race strategy for Q36.5 Pro Cycling.
#microsoft-fabric
fromTheregister
1 week ago

UK Excel champ crowned

"It was a hard fought battle. To win by 11 points out of a maximum possible 3,750 is what some might call 'by the skin of my teeth'."
Data science
Data science
fromFlowingData
2 weeks ago

Sorting data, the quiz game

Dataguessr is a daily sorting game where players rank seven countries by dataset values, aiming to place as many correctly as possible.
Data science
fromFlowingData
2 weeks ago

Chartle, a daily guessing game with charts

Chartle is a daily Wordle-like game where players identify the country represented by a red line on a demographic time-series chart within five guesses.
Data science
fromComputerWeekly.com
2 weeks ago

Cloud file storage: Key benefits and use cases | Computer Weekly

Cloud-based file storage replaces local file servers/NAS, offering scalable, tiered, redundant storage suitable for general and specialist workloads like media and AI analytics.
Data science
fromFlowingData
2 weeks ago

Explaining the true size of Africa, a lesson in map projections

Africa's landmass is far larger than commonly portrayed, and Mercator projection significantly distorts relative sizes compared with equal-area projections.
Data science
fromInfoWorld
2 weeks ago

MongoDB adds vector search to self-managed editions to power generative AI apps

Specialty vector databases add user-friendly features while traditional providers add vector capabilities; companies prioritize flagship managed services and release vector search in public preview.
fromBattery Power
2 weeks ago

Who will be the 2026 Geraldo Perdomo/Maikel Garcia?

Right now, if you go to FanGraphs and sort by position player fWAR with 200 or more PAs, you get a list that maybe isn't that surprising. Or, rather, the placement of some guys might be surprising, but anyway... Aaron Judge, Bobby Witt Jr., Shohei Ohtani - those guys are phenomenal but they were better last year. Cal Raleigh is having a legendary season but has been an All-Star-plus quality guy for years now.
Data science
Data science
fromComputerworld
2 weeks ago

Solving world hunger with data

Curiosity and technical instincts can enable a transition from software development to data leadership, with risk-taking leading to long-term career payoff.
Data science
fromMedium
4 weeks ago

Orchestrating RAG pipelines with Apache Airflow

Apache Airflow provides flexible, reliable orchestration for production GenAI pipelines, enabling tool-agnostic, extensible, retry-capable workflows for embeddings, vector storage, and query pipelines.
Data science
fromTheregister
3 weeks ago

Neo4j intros 'property sharding' to tackle scalability

Infinigraph's property sharding enables horizontally scalable graph storage while preserving traversal performance and supporting both transactional and analytical workloads on a single system.
fromBusiness Matters
3 weeks ago

Why Every Trader Needs a Crypto Backtesting Tool Before Going Live

Trading can be exciting, but it is also unpredictable. Many traders lose money because they start trading live without testing their strategy. This is where backtesting comes in. It allows traders to test their strategies on historical trading data before risking real money. By understanding how a strategy would have worked in different market conditions, traders can make smarter decisions and reduce risks.
Data science
Data science
fromESPN.com
3 weeks ago

Matchup rankings: Drake Maye, Ricky Pearsall stand out in Week 2

Start the player with the superior matchup using schedule-independent Adjusted Fantasy Points Allowed to compare defenses after calibrating for strength of opponents.
Data science
fromMedium
3 weeks ago

Basics of Big Data and Streaming

Scala, Spark, Kafka, and Amazon EMR together enable scalable, high-performance batch and real-time big data processing pipelines.
Data science
fromABC7 Los Angeles
3 weeks ago

See how your cost of living has changed with the ABC Price Tracker

Interactive Price Tracker shows decade-long, region-specific prices for essentials across the 100 largest U.S. metro areas and updates automatically with the latest data.
#data-strategy
fromMedium
3 weeks ago

You might be a victim of corrupt personalization

Netflix emphasizes that the more you use the platform, the more personalized it will become. Source. Are you sure your feeds - Netflix, Amazon, whatever social media you prefer - is providing you with personalized content? (More about the difference between personalization and customization here.) Are you being given content that aligns with your actual interests, or is the algorithm steering you around?
Data science
Data science
fromRubyflow
3 weeks ago

Topical: Topic Modeling Pipeline for Ruby

A Ruby gem that provides a complete topic modeling pipeline using ClusterKit clustering and c-TF-IDF, combining Rust performance with Ruby usability.
Data science
fromDATAVERSITY
3 weeks ago

Women in Data: Meet Andrea Barber - DATAVERSITY

Andrea Barber builds accessible, beginner-focused Python and data analytics resources while advancing women’s empowerment and ethical, equitable use of healthcare data.
Data science
fromInfoWorld
3 weeks ago

Databricks adds Data Science Agent to automate analytics tasks

Databricks added the Data Science Agent to the Databricks Assistant to help data practitioners automate analytics tasks, including exploration, model training, and error diagnosis.
fromInfoQ
3 weeks ago

Google Spanner Unifies OLTP and OLAP with Columnar Engine

Google recently introduced a columnar engine for its globally distributed database, Spanner, intending to resolve the long-standing conflict between online transaction processing (OLTP) and analytical query processing (OLAP). The new feature, currently in preview, allows Spanner (Enterprise and Enterprise Plus editions) to handle both workloads simultaneously on a single database, eliminating the need for separate data warehouses and complex ETL (Extract, Transform, Load) pipelines.
Data science
fromInfoWorld
4 weeks ago

Databot: AI-assisted data analysis in R or Python

Can you create a histogram of game total scores to see the distribution of scoring? Could you make a box plot comparing home vs away team scores? Let's create a scatter plot of temperature vs total score to see if weather affects scoring. Can you show me the distribution of betting spreads and how they relate to actual game results? Could you create a visualization showing win/loss records by team?
Data science
Data science
fromWIRED
4 weeks ago

Is Congestion Pricing Working? The MTA's Revamped Data Team Is Figuring It Out

MTA's data team published real-time congestion-pricing and vehicle-entry data, centralizing transit datasets to increase transparency and enable public evaluation.
fromBarchart.com
4 weeks ago

Google Just Surged 9%! Here are 2 Options Trades to Keep Riding the Rally

Want to use this as your default charts setting? Save this setup as a Chart Templates Switch the Market flag for targeted data from your country of choice. Open the menu and switch the Market flag for targeted data from your country of choice. Need More Chart Options? Right-click on the chart to open the Interactive Chart menu. Use your up/down arrows to move through the symbols.
Data science
fromLondon Business News | Londonlovesbusiness.com
4 weeks ago

How data is changing the business of sports and fan engagement - London Business News | Londonlovesbusiness.com

Cheering at the stadiums and buying replica jerseys shifted to new ways to consume sports. Live matches on the Sportsbet betting platform, social media, fantasy leagues, highlights, and apps are capturing the attention of today's fans. Teams and brands understand that to keep fans engaged, they need to meet them wherever they are. This triggered an entirely new approach based on data about fans' behaviours, which proved to be just as valuable as the sports themselves.
Data science
fromFlowingData
1 month ago

What counts as rude behavior in public, by age group

Pew Research asked U.S. adults if certain behaviors in public, such as cursing or smoking, were acceptable. The above are the results for four age groups. For every behavior, the percentage of people who said it was rarely or never acceptable increased with age. Television and movies (and my own experiences) would tell you that sounds about right, but for some reason the clear trend surprised me. A quiz with the behaviors lets you get in on the action to see how crotchety you are.
Data science
Data science
fromElectronic Frontier Foundation
1 month ago

Open Austin: Reimagining Civic Engagement and Digital Equity in Texas

Open Austin trains Central Texans to build open-source civic technology, scaling a Data Research Hub answering residents' questions for community-driven solutions.
fromInfoWorld
1 month ago

From Teradata to lakehouse: Lessons from a real-world data platform modernization

Over the course of several years designing and delivering enterprise data platforms for a global pharmaceutical leader, I witnessed firsthand how data had evolved from a backend enabler to a frontline business asset. The organization was no longer just looking to report historical performance; it needed to predict outcomes, personalize patient engagement, customer engagement, brand performance and make regulatory decisions in near real time.
Data science
Data science
fromSimplilearn.com
2 years ago

Machine Learning Engineer vs. Data Scientist: How Do They Differ? | Simplilearn

Nearly every industry is being disrupted by Machine learning and data science.
They're so prevalent that many of us don't even realize how much they've changed our world.
Data science
fromFlowingData
1 month ago

Most American and British words

Spoken-word usage shows greater American–British divergence than written language, increasing as more commonly spoken words are emphasized.
Data science
fromInfoWorld
1 month ago

Using Cosmos DB in Microsoft Fabric

Cosmos DB integrates with Microsoft Fabric, enabling large-scale analytics of operational data for enterprise AI across diverse data types and familiar data science tools.
Data science
fromZDNET
1 month ago

Graph databases are exploding, thanks to the AI boom - here's why

Graph databases are the fastest-growing database category, driven by AI, with projected annual growth rates around 24–26%.
Data science
fromQuansight
1 month ago

Expressions are coming to pandas!

Pandas added a new, chainable column-assignment syntax to replace lambda-based patterns, improving predictability, introspection, and safety for dataframe operations.
Data science
fromTechzine Global
1 month ago

VMware launches Tanzu Data Intelligence for AI-driven apps

Tanzu Data Intelligence provides an on-premises enterprise lakehouse unifying structured and unstructured data to improve AI readiness and accelerate private-cloud AI agent development.
#data-lakehouse
fromDevOps.com
2 months ago
Data science

StarTree Bridges the Lakehouse Gap: Serving Apache Iceberg Data Directly to Applications - DevOps.com

fromDevOps.com
2 months ago
Data science

StarTree Bridges the Lakehouse Gap: Serving Apache Iceberg Data Directly to Applications - DevOps.com

Data science
fromLondon Business News | Londonlovesbusiness.com
1 month ago

Field service math for heavy equipment: How to prove ROI with the right metrics - London Business News | Londonlovesbusiness.com

Field service performance must be driven by validated metrics linking field execution to financial outcomes, focusing on first-time fixes, planned maintenance, and digital tooling.
#big-data
frommedium.com
1 month ago
Data science

Complete Guide to Learn Big Data

Learn big data end-to-end: fundamentals, programming, storage, batch/stream processing, ETL, cloud, ML, governance, and hands-on projects with runnable Airflow and PySpark Docker examples.
fromMedium
1 month ago
Data science

Why Your Big Data Architecture is Flawed

Data centrality and single-machine memory limits force adoption of new computational toolkits and scalable infrastructure to extract practical value from growing information streams.
Data science
fromBusiness Matters
1 month ago

Data-Driven Manager Decisions: From Time Reports to Team Growth

Analytics-driven time tracking transforms workforce management by providing real-time, AI-enhanced insights that optimize productivity, resource allocation, and organizational structure.
Data science
fromFast Company
1 month ago

What you can do about the government data that's disappearing

Federal government datasets are disappearing or being altered, undermining statistical trust and prompting archives and researchers to rescue and preserve public data.
Data science
fromTalkpython
1 month ago

Accelerating Python Data Science at NVIDIA

RAPIDS enables zero-code GPU acceleration for pandas, scikit-learn, NetworkX, and other Python data libraries, delivering large speedups and scalable GPU-native workflows.
Data science
fromHackernoon
3 months ago

How to Create a Foreign Data Wrapper in PostgreSQL and Aurora PostgreSQL on AWS RDS | HackerNoon

Foreign data wrappers enhance functionality in PostgreSQL and Aurora PostgreSQL by enabling external data integration.
fromHackernoon
5 months ago

Stationarity and Correlation Insights from VAR Modeling of Gas Base Fees | HackerNoon

The ADF test results confirm that both the gas base fee and blob gas base fee time series are stationary, with test statistics of -6.3719 and -10.5237.
Data science
Data science
fromDigiday
1 month ago

In AI and data, WPP Media revives a playbook it thinks it can finally win

WPP Media focuses on leveraging extensive data to differentiate itself in a competitive market.
fromTechzine Global
1 month ago

Snowflake launches Snowpark Connect to run Spark code natively

Snowpark Connect facilitates Apache Spark code execution directly within Snowflake warehouses, eliminating the need for separate Spark clusters and associated complexities like data movement.
Data science
fromTheregister
1 month ago

Snowflake builds Spark clients for its own analytics engine

Customers have been using Spark for a long time to process data and get it ready for use in analytics or in AI. The burden of running in separate systems with different compute engines creates complexity in governance and infrastructure.
Data science
Data science
fromHackernoon
2 years ago

How a Startup Using Gremlin Beat Everyone to Google's Door | HackerNoon

Google's acquisition of Wiz for $32 billion signifies a decisive victory in the cloud security sector.
Data science
fromIT Pro
1 month ago

Are geothermal data centers just hot air?

Geothermal energy is a reliable renewable source for powering large-scale data centers, particularly for high-density AI workloads.
Data science
fromTechzine Global
1 month ago

Scale Computing and Veeam now deliver full backup integration

Veeam's backup software integrates with Scale Computing's virtualization platform, enabling agentless hypervisor backup.
fromHackernoon
4 months ago

5 Major Business Mistakes When Working with Big Data: Lessons from a Company Managing 16 TB of Data | HackerNoon

Over a quarter of data and analytics professionals worldwide estimate that poor-quality data costs companies over $5 million annually, with 7% putting the figure at $25 million or more.
Data science
Data science
fromInfoWorld
1 month ago

Google updates agents in BigQuery to further automate analytics tasks

Google enhances BigQuery with a new code interpreter and advanced analytics features, improving automation in data engineering and data science tasks.
#data-centers
fromInfoWorld
1 month ago

Apache Flink integrates AI for real-time decision-making

With the 2.1 release, Apache Flink also now supports Process Table Functions (PTFs), the most powerful kind of function for Flink SQL and Table API.
Data science
Data science
fromMarTech
2 months ago

Messy data is your secret weapon - if you know how to use it | MarTech

Recent advances in AI enable effective analysis of messy, unstructured data, challenging the long-held belief that data must be clean.
fromInfoQ
2 months ago

Building Reproducible ML Systems with Apache Iceberg and SparkSQL: Open Source Foundations

Time travel in Apache Iceberg allows users to precisely identify which data snapshot produced exceptional results, eliminating the need to sift through production logs.
Data science
Data science
fromMedium
2 months ago

Scaling AI Responsibly: Lessons in Efficiency, Flexibility, and Platform Design

AI tooling development must prioritize speed and user-centric solutions to drive real-world impact.
Data science
fromTechzine Global
2 months ago

Comeback of LTO tape: market grew significantly in 2024

LTO tape market experienced significant growth in 2024, with 176.5 exabytes of compressed capacity introduced, marking a 15.4% increase from 2023.
Data science
fromNew Relic
2 months ago

Database Performance Monitoring - Now GA: Deep Query Analysis

Enhanced Database Performance Monitoring enables direct query-level insights, improving DBAs' ability to manage database performance.
fromHackernoon
6 months ago

Redefining Data Operations With Data Flow Programming in CocoIndex | HackerNoon

In traditional systems, side effects lead to increased complexity, debugging challenges, and unpredictable behavior. CocoIndex adopts a pure data flow programming approach, ensuring reliability.
Data science
Data science
fromHackernoon
2 months ago

Effective Data Chunking and Querying with Pinecone and GPT-4o | HackerNoon

Optimizing data ingestion in Pinecone involves preprocessing markdown and splitting articles into fixed-length chunks for improved relevance.
Data science
fromInfoWorld
1 year ago

Snowflake updates developer tools, adds observability features

Snowflake introduces Trail for enhanced observability in data management workflows.
[ Load more ]