User Tools

Site Tools


retreats:2022fall:abstracts

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
retreats:2022fall:abstracts [2022/11/03 15:28]
kilov
retreats:2022fall:abstracts [2022/11/03 15:37] (current)
kilov
Line 5: Line 5:
   - **Document Organization Three Ways**\\ Despite advances in natural language processing, computer vision, and other techniques that simplify the processing of large, unstructured documents such as PDFs, present-day tools remain difficult to use. Many experts from non-technical domains continue to process large, messy document datasets manually, while others become self-taught programmers. For teams with limited time, budgets, and computing education, this is a heavy burden. Our study assesses the learnability of three categories of programming interaction for document processing: textual, visual, and programming-by-example. We conducted a counterbalanced within-subject study (n=12) in which participants used all three programming paradigms. Our qualitative analysis reveals patterns in their relative benefits, including how participants reported Visual programming paradigms gave them a broader understanding of their data. Our results suggest design opportunities for tools that aim to help domain experts complete programming tasks.\\ \\   - **Document Organization Three Ways**\\ Despite advances in natural language processing, computer vision, and other techniques that simplify the processing of large, unstructured documents such as PDFs, present-day tools remain difficult to use. Many experts from non-technical domains continue to process large, messy document datasets manually, while others become self-taught programmers. For teams with limited time, budgets, and computing education, this is a heavy burden. Our study assesses the learnability of three categories of programming interaction for document processing: textual, visual, and programming-by-example. We conducted a counterbalanced within-subject study (n=12) in which participants used all three programming paradigms. Our qualitative analysis reveals patterns in their relative benefits, including how participants reported Visual programming paradigms gave them a broader understanding of their data. Our results suggest design opportunities for tools that aim to help domain experts complete programming tasks.\\ \\
   - **Exploring the Learnability of Program Synthesizers by Novice Programmers**\\ Tools known as program synthesizers show promise to lighten the burden of programming by automatically writing code for users, but little research has addressed what contributes to and detracts from their learnability by novice programmers. For example:   - **Exploring the Learnability of Program Synthesizers by Novice Programmers**\\ Tools known as program synthesizers show promise to lighten the burden of programming by automatically writing code for users, but little research has addressed what contributes to and detracts from their learnability by novice programmers. For example:
-  * How do synthesizers' user interaction models affect their learnability? +    - Ordered List ItemHow do synthesizers' user interaction models affect their learnability? 
-  What kinds of inputs are least burdensome to learn to provide to the synthesizer? +    What kinds of inputs are least burdensome to learn to provide to the synthesizer? 
-  What common misconceptions do novice programmers demonstrate in their use of program synthesizers? +    What common misconceptions do novice programmers demonstrate in their use of program synthesizers? 
-  We observed novice programmers working with real, released program synthesizers to answer these questions and more. +    We observed novice programmers working with real, released program synthesizers to answer these questions and more.\\ From our analysis, we provide a set of design opportunities to inform the design of future program synthesizers. Our findings have ramifications for the use of program synthesis in data work.\\ \\
-From our analysis, we provide a set of design opportunities to inform the design of future program synthesizers. Our findings have ramifications for the use of program synthesis in data work.\\ \\+
   - **Always-on Visualization Recommendations**\\ Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data. Yet, visually exploring data in dataframes remains tedious, requiring substantial programming effort for visualization and mental effort to determine what analysis to perform next. We propose Lux, an always-on framework for accelerating visual insight discovery in dataframe workflows. When a dataframe is printed, Lux recommends visualizations to provide a quick overview of the patterns and trends and suggest promising analysis directions. Users can tailor recommendations via a lightweight intent language. Lux also leverages scalable data computation techniques to generate recommendations quickly. Lux has been embraced by data science practitioners -- and especially by novice data scientists -- with over 400K downloads and 4.2k stars on Github.\\ \\   - **Always-on Visualization Recommendations**\\ Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data. Yet, visually exploring data in dataframes remains tedious, requiring substantial programming effort for visualization and mental effort to determine what analysis to perform next. We propose Lux, an always-on framework for accelerating visual insight discovery in dataframe workflows. When a dataframe is printed, Lux recommends visualizations to provide a quick overview of the patterns and trends and suggest promising analysis directions. Users can tailor recommendations via a lightweight intent language. Lux also leverages scalable data computation techniques to generate recommendations quickly. Lux has been embraced by data science practitioners -- and especially by novice data scientists -- with over 400K downloads and 4.2k stars on Github.\\ \\
   - **Human-Centered Tools for Reliable Use of Machine Translation**\\ Although machine translation (MT) technology has been rapidly improving, actual user needs for these systems remain relatively poorly understood and, as a result, unmet. For example, current MT systems do not help users understand when they can rely on translations, or when the system has made an error. MT holds great potential to increase access to information and improve social interactions across languages. However, undetected mistranslations can cause serious harm, especially when MT is used in high stakes settings like healthcare. In this talk, I will discuss how we might develop MT tools that provide actionable, useful support for users to understand when translations are reliable and to recover or adapt when they are not. In ongoing work, we are developing tools to improve written cross-lingual communication in medical settings. By combining pre-translated phrases and machine translation we strive to provide clinicians with greater insight into and control over output accuracy when crafting instructions for patients.\\ \\   - **Human-Centered Tools for Reliable Use of Machine Translation**\\ Although machine translation (MT) technology has been rapidly improving, actual user needs for these systems remain relatively poorly understood and, as a result, unmet. For example, current MT systems do not help users understand when they can rely on translations, or when the system has made an error. MT holds great potential to increase access to information and improve social interactions across languages. However, undetected mistranslations can cause serious harm, especially when MT is used in high stakes settings like healthcare. In this talk, I will discuss how we might develop MT tools that provide actionable, useful support for users to understand when translations are reliable and to recover or adapt when they are not. In ongoing work, we are developing tools to improve written cross-lingual communication in medical settings. By combining pre-translated phrases and machine translation we strive to provide clinicians with greater insight into and control over output accuracy when crafting instructions for patients.\\ \\
retreats/2022fall/abstracts.1667514519.txt.gz · Last modified: 2022/11/03 15:28 by kilov