{"id":2261,"date":"2018-03-10T03:38:27","date_gmt":"2018-03-10T03:38:27","guid":{"rendered":"http:\/\/wp.sigmod.org\/?p=2261"},"modified":"2019-09-23T12:34:16","modified_gmt":"2019-09-23T12:34:16","slug":"a-tribute-to-jose-alfredo-blakeley","status":"publish","type":"post","link":"https:\/\/wp.sigmod.org\/?p=2261","title":{"rendered":"A Tribute to Jos\u00e9 Alfredo Blakeley"},"content":{"rendered":"<div align=\"justify\">\n Jos\u00e9 Alfredo Blakeley, Partner Architect at Microsoft, passed away on January 7th, 2018. With this &#8220;tribute&#8221;, we would like to honor his many contributions to data management. Jos\u00e9 will be sorely missed as a great scientist, mentor, colleague, and friend.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-2266\" src=\"http:\/\/wp.sigmod.org\/wp-content\/uploads\/2018\/03\/JoseBlakeley.jpg\" alt=\"\" width=\"177\" hspace=\"20\" height=\"266\" srcset=\"https:\/\/wp.sigmod.org\/wp-content\/uploads\/2018\/03\/JoseBlakeley.jpg 533w, https:\/\/wp.sigmod.org\/wp-content\/uploads\/2018\/03\/JoseBlakeley-100x150.jpg 100w, https:\/\/wp.sigmod.org\/wp-content\/uploads\/2018\/03\/JoseBlakeley-200x300.jpg 200w, https:\/\/wp.sigmod.org\/wp-content\/uploads\/2018\/03\/JoseBlakeley-233x350.jpg 233w, https:\/\/wp.sigmod.org\/wp-content\/uploads\/2018\/03\/JoseBlakeley-47x71.jpg 47w\" sizes=\"auto, (max-width: 177px) 100vw, 177px\" \/><\/p>\n<p>Jos\u00e9 completed his undergraduate studies at Tecnol\u00f3gico de Monterrey, Mexico in 1978. He continued his studies in Computer Science at University of Waterloo, Canada, receiving an MMath degree in 1983 and PhD in 1987. After graduation he joined Indiana University, Bloomington, as a faculty member. He moved to Texas Instruments in 1989 where he worked on the development of an OODB. Jos\u00e9 joined Microsoft in 1994 and spent the remainder of his career in Redmond. His first project was OLE DB (a database access interface). He led the integration of the .NET Common Language Runtime (CLR) into SQL Server during 2003-2005 and was the lead architect of the Data Programmability Group during 2005-2007. His main focus over the last 10 years was building and shipping the SQL Server Parallel Data Warehouse (PDW) appliance (2007-2013) and the Azure Data Lake Analytics cloud service (starting in 2013). Jos\u00e9 has 19 granted patents. Over the years, he has served on many database research program committees, including as VLDB 2004 Industry PC Chair, ICDE 2008 PC Chair and VLDB 2011 General PC Co-Chair. He became an ACM Fellow in 2009.<\/p>\n<h3>PhD in Waterloo: Materialized Views<\/h3>\n<p>Jos\u00e9 arrived at Waterloo to begin work on his MMath degree in September 1981 under the direction of Frank Tompa, completing his coursework requirements and an essay entitled \u201cThe Design of an Electronic Telephone Directory.\u201d He then embarked on his PhD program, choosing to work with both Paul Larson and Frank Tompa on database query processing and, more specifically, on how to maintain materialized views efficiently. His doctoral dissertation answers the following question: given the declaration of a select-project-join (SPJ) view, a database instance D, the corresponding view instance V, and an update U (either a set of tuples I to be inserted into the database, a selection condition S specifying the set of tuples to be deleted from the database, or a selection condition M specifying the set up tuples to be modified and how to modify them), how can V be most efficiently updated to reflect the new state of the database?<\/p>\n<p>Jos\u00e9\u2019s contributions centered on several aspects of this problem:- Under what conditions can U be safely ignored because the specified insertion, deletion, or modification cannot cause any changes to V?<\/p>\n<p>&#8211; If U cannot be ignored, under what conditions can the appropriate updates to V be determined a) without accessing base tables and data in V or b) with accessing data in V but not in base tables?<\/p>\n<p>&#8211; In all cases, what is an efficient algorithm to compute the required updates to V and apply them?<\/p>\n<p>At the time that Jos\u00e9 embarked on this research, others had begun to investigate how to use materialized views to make query processing more efficient, but little work addressed the question of how to incrementally update views after an update. His SIGMOD 1986 paper contains the fundamental algorithms for incremental view maintenance and is one of the classics in the area. His contributions in this area are documented in references [1 \u2013 6]. The techniques used to maintain materialized views in all database systems (including data warehouses) have built upon this work. Today all major commercial systems support materialized views, but the first deployment emerged more than ten years after Jos\u00e9\u2019s pioneering work.<\/p>\n<h3>Professor at Indiana University<\/h3>\n<p>Jos\u00e9 spent two years in academia at Indiana University in Bloomington where he co-advised one PhD student, Anand Deshpande. At that time, Jos\u00e9 worked on two ideas. The first idea was to apply database abstractions (e.g., SQL and ACID transactions) to the design of operating systems. The second idea was a new way to implement nested relational database systems which became an important part of Anand&#8217;s PhD thesis. That idea also inspired work on path indexes and especially XML document databases more than a decade later.<\/p>\n<h3>Texas Instruments: Object Databases<\/h3>\n<p>Object-oriented databases (OODBs aka ODBs) were conceived in the mid-1980s to reduce the <em>impedance mismatch<\/em> between database and programming language data structures. The initial idea for OODBs was to <em>seamlessly <\/em>persist and share programming language data types. At Texas Instruments, a team of researchers explored OODBs to store CAD databases of circuits persistently. Jos\u00e9 joined this project in 1989 and, with his strong relational background, made the observation that persistence and query capabilities were <em>orthogonal<\/em>. That led him to propose that querying over persistent or transient CAD or other programming language data structures be supported. In 1990, Jos\u00e9 developed (prototyped, documented and patented) TI&#8217;s OODB\u2019s query service OQL[X] where X refers to the data structures in a host programming language, e.g., C++. Jos\u00e9 was responsible for the language interface, query optimizer, and set execution runtime [7 \u2013 9].<\/p>\n<p>Subsequently, the work of the TI team also led to the DARPA-funded Open OODB project (1990-1995) which developed the first object-oriented DBMS built as a componentized system. Through two key contributions, the DARPA Open OODB architecture influenced the then-nascent middleware industry that blossomed into service-oriented architectures (SOA), today a mainstay of enterprise computing. First, the team authored an influential technical report in 1989 that was presented at early Object Management Group (OMG) meetings that influenced the OMG Object Management <em>Architecture Guide<\/em>&#8216;s &#8220;Reference Model&#8221;, a bus architecture with services available on the bus [10 \u2013 12]. Second, the DARPA Open OODB modules were each separately documented using a common template, effectively a <em>design pattern<\/em> (four years before that term came into general use via the Gang of Four\u2019s famous text). At OMG, Jos\u00e9 and his colleagues contributed to OMG&#8217;s <em>Object Service Architecture<\/em>, a compendium of middleware design patterns that drove OMG standards related to CORBA Services throughout the 1990s. These service interfaces populated industry\u2019s first widely useful service-oriented architecture [13].<\/p>\n<p>Jos\u00e9&#8217;s OODB work is synthesized in a book chapter that describes query processing in OODBs [14].<\/p>\n<h3>Microsoft: OLE DB \/ ADO<\/h3>\n<p>Jos\u00e9 joined Microsoft in 1994, as an architect to the OLE DB team, where he continued his passion for reducing the impedance mismatch between traditional databases and modern programming languages. Jos\u00e9 authored one of the early papers that formed the team\u2019s strategy and vision, a must read for everyone who was hired into the team: &#8220;<em>Data Access for the Masses through OLE DB<\/em>&#8221; [15]. The overall premise was to allow applications (the masses) to query and reason about data wherever it resided, as opposed to requiring all data to be moved to a traditional database. This fundamental shift to UDA (universal data-access) was both complementary to the traditional view of &#8220;database at the center of the universe&#8221;, and allowed multiple heterogeneous data sources; flat files, sequential files, desktop databases, email, directory services, traditional database, and even the web with ADO integration (ActiveX Data Objects).<\/p>\n<p>Most of the OLE DB and ADO teams knew Jos\u00e9 as the face of data-access (UDA), a key founding father of OLE DB, one who impacted the strategic direction of the team, investments, and customers. Others knew Jos\u00e9 as a key architect who they butted heads with, as he had strong convictions on the technology front, from the specification, to the programming surface (API), all the way to the codebase itself. Others knew Jos\u00e9 as a magnet for new hires, spending a lot of time recruiting the team, convincing them to come to Microsoft, and mentoring them once on board.<\/p>\n<p>Many team members also got to know Jos\u00e9 as a friend who would routinely share fatherly advice. Jos\u00e9 always found the balance between work, making the world a better place, and family life.<\/p>\n<h3>.NET CLR Integration, ADO.NET Entity Frameworks<\/h3>\n<p>After OLE DB, Jos\u00e9 joined the SQL Server engine team as architect for the SQL CLR effort, to host the .NET Common Language Runtime (CLR) in Microsoft SQL Server. With the CLR hosted in Microsoft SQL Server, database developers can author stored procedures, triggers, user-defined functions, user-defined types, and user-defined aggregates in .NET managed code, providing safety and JIT (Just-in-time) compilation for performance [16].<\/p>\n<p>In 2004, Jos\u00e9 came back to extend ADO to the next generation. He joined the ADO.NET group to raise the programming abstraction from relations to entity types in ADO.NET by incorporating Microsoft\u2019s Entity Data Model (EDM). EDM is an extended relational model that treats entities and relationships as first class concepts, a query language for EDM, a comprehensive mapping engine that translates from the conceptual (entity) to the logical (relational) level, and model-driven tools that help define and maintain mappings. Collectively, these services are called the ADO.NET Entity Framework. The entity framework provided conceptual model footing for subsequent data programming efforts like OData and Microsoft Graph API [17]. It is widely used and still under active development as an open source project [20, 21].<\/p>\n<h3>Parallel Data Warehouse<\/h3>\n<p>In 2007 the data warehousing industry was shifting towards appliance solutions where hardware and MPP software would be engineered to work together. Microsoft was considering an acquisition to embrace the trend though there was risk and uncertainty. It was Jos\u00e9\u2019s technical understanding of the technology and his always optimistic outlook that gave us the encouragement to proceed with a deal. Jos\u00e9 dedicated many of the following years of his career to ensure the success of the project.<\/p>\n<p>There was no end to the technical challenges the project had to overcome, and Jos\u00e9 left his positive mark all over the team and the technology. He helped attract talent, and personally spent time with each engineer showing them what great engineering looked like. He influenced technical choices in the core database system but also weighed in on the hardware platform including compute, networking and storage decisions.<\/p>\n<p>Beyond technical contributions, Jos\u00e9 displayed leadership that will be remembered for many years to come. In times of pressure Jos\u00e9 provided a calm perspective, and in times where complacency threatened the quality of deliverables Jos\u00e9 held everyone to the highest standards. Jos\u00e9 sacrificed a significant amount of family time by flying down to Southern California every week for many years to ensure the success of the team and the project. In the end he succeeded, the technology shipped in appliance form known as Parallel Data Warehouse (PDW) and is also the foundation of Microsoft\u2019s Azure Data Warehouse service.<\/p>\n<p>The passion, perseverance, and dedication Jos\u00e9 displayed during these years will have a lasting impact and be remembered for decades to come.<\/p>\n<h3>Data Analytics in the Cloud: Azure Data Lake<\/h3>\n<p>Three disruptions impacted the database and data warehouse industry in the last decade: (a) the cloud, (b) new complex workloads, and (c) open-source systems such as Hadoop. Microsoft had an early response to these disruptions: A system called Cosmos was developed for internal use within Microsoft to build prediction models for Bing, improve service quality for Skype, and analyze the availability of Office 365, among many other scenarios. Cosmos has all the features that are needed to do complex analytics and machine learning on semi-structured data in the file system (e.g., logs). It has support for relational operators and user-defined functions. Furthermore, Cosmos scales to tens of thousands of machines and exa-bytes of data, per cluster. However, Cosmos was not designed from the ground up to host workloads and data from Microsoft customers.<\/p>\n<p>In the last four years, Jos\u00e9 was an architect in Azure Data Lake Analytics team. Azure Data Lake Analytics is the service that brings the core Cosmos technology to Microsoft customers: It is an elastic and fully managed service that allows users to pay-as-they-go. It features a new powerful query language with user-defined functions, called U-SQL. It is fully compliant and secure and like Cosmos scales to large clusters and huge data sizes. Jos\u00e9 was a leader in exploring extensions to U-SQL. Jos\u00e9 was also passionate about the robustness and performance of the system.Until literally his last moments, he worked on a new scheduling algorithm [17] and a new benchmarking framework to study the performance and cost \/ response time &amp; availability trade-offs of managed services like Azure Data Lake Analytics [18, 19]. He was constantly fighting fires and addressing scalability and capacity issues that arise in large-scale systems like Azure Data Lake. Last but not the least, he was a great recruiter and mentor to many people in the team.<\/p>\n<h4>Contributors<\/h4>\n<p>Philip Bernstein, Pedro Celis, Surajit Chaudhuri, Anand Deshpande, David DeWitt, Kent Foster, Cesar Galindo-Legaria, Christian Kleinerman, Donald Kossmann, Per-Ake Larson, David Lomet, Anil Nori, Frank Tompa, Tamer \u00d6zsu, Raghu Ramakrishnan, Michael Rys, Clemens Szyperski, Craig Thompson, Ed Triou, Dirk van Gucht<\/p>\n<p><strong>Footnotes<\/strong><\/p>\n<p>[1] Jos\u00e9 A. Blakeley: \u201cUpdating Materialized Database Views,\u201d PhD Thesis, Department of Computer Science, University of Waterloo, 1987 (joint supervision: Per-\u00c5ke Larson and Frank Tompa).<\/p>\n<p>[2] Jos\u00e9 A. Blakeley, Neil Coburn, and Per-\u00c5ke Larson: Updating Derived Relations: Detecting Irrelevant and Autonomously Computable Updates. VLDB Conference 1986: 457-466.<\/p>\n<p>[3] Jos\u00e9 A. Blakeley, Per-\u00c5ke Larson, and Frank Wm. Tompa: Efficiently Updating Materialized Views. SIGMOD Conference 1986: 61-71.<\/p>\n<p>[4] Frank Wm. Tompa and Jos\u00e9 A. Blakeley: Maintaining materialized views without accessing base data. Inf. Syst. 13(4): 393-406 (1988).<\/p>\n<p>[5] Jos\u00e9 A. Blakeley, Neil Coburn, and Per-\u00c5ke Larson: Updating Derived Relations: Detecting Irrelevant and Autonomously Computable Updates. ACM Trans. Database Syst. 14(3): 369-400 (1989).<\/p>\n<p>[6] Jos\u00e9 A. Blakeley and Nancy L. Martin: Join Index, Materialized View, and Hybrid-Hash Join: A Performance Analysis. ICDE Conference 1990: 256-263.<\/p>\n<p>[7] Jos\u00e9 Blakeley, Craig Thompson: \u201cApparatus and Method for Adding an Associative Query Capability to a Programming Language,\u201d Patent application filed April 1990, issued as U.S. Patent 5,761,493, June 1998, and U.S. Patent 5,826,077, October 1998.<\/p>\n<p>[8] Jos\u00e9 Blakeley, Craig Thompson, Abdulah Alashqur: \u201cStrawman Reference Model for Object Query Languages,\u201d International Journal of Computer Standards and Interfaces, 1991.<\/p>\n<p>[9] Craig Thompson, Jos\u00e9 Blakeley, David Wells: &#8220;Object Query Service,&#8221; OMG Documents 09-44, September 1994.<\/p>\n<p>[10] Craig Thompson, Jos\u00e9 Blakeley, Tom Bannon, John Chen, Tom Ekberg, Steve Ford, Anil Gupta, J. Joseph, Edward Perez, Diana Sparacin, Robert Peterson, Mark Shadowens, Satish Thatte, Chung Wang, David Wells: &#8220;Open Architecture for Object-oriented Database Systems,&#8221; Texas Instruments Technical Report ITL 89-12-01, December 1989. OMG Document 1990\/90-01-06.<\/p>\n<p>[11] William Andreas, Goeff Lewis, Matthew Mathews, Lee Scheffler, R. Soley, Craig Thompson: &#8220;Reference Model,&#8221; Object Management Architecture Guide, OMG Document 1990\/90-09-01.<\/p>\n<p>[12] David Wells, Jos\u00e9 Blakeley, Craig Thompson: &#8220;Architecture of an Open Object-oriented Database Management System,&#8221; IEEE Computer, Special Issue on Object-Oriented Applications: 74-82(1992).<\/p>\n<p>[13] &#8220;OMG Object Services Architecture V8.0,&#8221; OMG, December 1994.<\/p>\n<p>[14] M.Tamer \u00d6zsu and Jos\u00e9 Blakeley, &#8220;Query Optimization and Processing in Object-Oriented Database Systems,&#8221; In Modern Database Management &#8211; Object-Oriented and Multidatabase Technologies, W. Kim (ed.), Addison-Wesley\/ACM Press, 1994, pages 146-174.<\/p>\n<p>[15] Jos\u00e9 Blakeley: &#8220;Data Access for the Masses through OLE DB,&#8221; SIGMOD Conference: 161-172 (1996).<\/p>\n<p>[16] Alazel Acheson, Mason Bendixen, Jos\u00e9 Blakeley, Peter Carlin, Ebru Ersan, Jun Fang, Xiaowei Jiang, Christian Kleinerman, Balaji Rathakrishnan, Gideon Schaller, Beysim Sezgin, Ramachandran Venkatesh, Honggang Zhang: &#8220;Hosting the .NET Runtime in Microsoft SQL Server,&#8221; SIGMOD Conference 2004: 860-865.<\/p>\n<p>[17] Jos\u00e9 Blakeley, S. Muralidhar, Anil Nori: &#8220;The ADO.NET Entity Framework: Making the Conceptual Level Real,&#8221; ER Conference 2006: 552-565.<\/p>\n<p>[18] Zhicheng Yin, Jin Sun, Ming Li, Jaliya Ekanyake, Haibo Lin, Marc Friedman, Jos\u00e9 Blakeley, Clemens Szyperski, Nikhil Devanur: &#8220;Bubble Execution: Resource-aware Reliable Analytics at Cloud Scale,&#8221; to appear in Proceedings of the VLDB 2018.<\/p>\n<p>[19] Umar Farooq Minhas, Jos\u00e9 Blakeley, Donald Kossmann, Raghu Ramakrishnan, Clemens Szyperski: &#8220;Benchmarking Cloud-based Big Data Analytics Services,&#8221; in preparation, 2018.<\/p>\n<p>[20] <a href=\"https:\/\/github.com\/aspnet\/EntityFrameworkCore\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/github.com\/aspnet\/EntityFrameworkCore<\/a><br \/>\n[21] <a href=\"https:\/\/github.com\/aspnet\/EntityFramework6\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/github.com\/aspnet\/EntityFramework6<\/a><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Jos\u00e9 Alfredo Blakeley, Partner Architect at Microsoft, passed away on January 7th, 2018. With this &#8220;tribute&#8221;, we would like to honor his many contributions to data management. Jos\u00e9 will be sorely missed as a great scientist, mentor, colleague, and friend. Jos\u00e9 completed his undergraduate studies at Tecnol\u00f3gico de Monterrey, Mexico in 1978. He continued his [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"coauthors":[78,76,63,60,77],"class_list":["post-2261","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"views":1605,"_links":{"self":[{"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=\/wp\/v2\/posts\/2261","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2261"}],"version-history":[{"count":15,"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=\/wp\/v2\/posts\/2261\/revisions"}],"predecessor-version":[{"id":2963,"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=\/wp\/v2\/posts\/2261\/revisions\/2963"}],"wp:attachment":[{"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2261"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2261"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2261"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/wp.sigmod.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcoauthors&post=2261"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}